2025 Global Chinese Large Model Ranking Released: Overseas Giants Take Top Three, Domestic Large Models Surpass in Niche Areas

SuperCLUE has officially released the "2025 Annual Chinese Large Model Benchmark Evaluation Report." This "all-star game" featuring 23 top domestic and international models once again reveals new trends in the global AI competition. The evaluation covered six core dimensions, including mathematical reasoning, code generation, and scientific reasoning, intuitively demonstrating the actual "combat power" of major models in the Chinese language context.

From the overall ranking, overseas closed-source models still show strong dominance. Claude-Opus-4.5-Reasoning from Anthropic achieved the highest score of 68.25 and ranked first, followed by Google's Gemini-3-Pro-Preview and OpenAI's GPT-5.2 (high), securing second and third places respectively. These three giants form the "first tier," maintaining a slight advantage in logical rigor and comprehensive understanding.

However, the performance of domestic large models is truly surprising, narrowing the gap at an unprecedented speed. The leading open-source model Kimi-K2.5-Thinking and the representative closed-source model Qwen3-Max-Thinking have both entered the global top ten, ranking fourth and sixth respectively. It is encouraging to note that in specialized fields, domestic models have already achieved "partial breakthroughs": Kimi won the global first place in code generation, while Qwen3 tied with Google in mathematical reasoning as world champion.

Looking at the overall situation, there are starkly different competitive dynamics between domestic and overseas markets. In the closed-source field, overseas models are currently leading, with domestic models catching up; however, in the open-source field, domestic models have taken an absolute leading position, with the top five domestic open-source models significantly outperforming their overseas counterparts. This "co-development of open and closed source" scenario indicates that the Chinese AI ecosystem is entering a period of high-quality growth.

Key points:

🏆 Overseas Giants Lead: Claude-Opus-4.5-Reasoning ranks first globally in Chinese large model combat power with the highest score, and overseas closed-source models still occupy the top three positions.
🚀 Domestic Models Achieve Partial Breakthroughs: Kimi-K2.5-Thinking wins first place in code generation, while Qwen3-Max-Thinking ties with Google in mathematical reasoning as world champion.
📊 Domestic Models Dominate in Open Source: In the open-source model group, domestic models perform far better than overseas competitors, showcasing the unique advantages of the domestic large model ecosystem in open collaboration.

Wanxi Technology Makes a Stunning Debut on the List! The Tianmu 2.0 Model Ranks Fourth in China, Collaborating with Huawei Cloud to Establish an AI Video Laboratory

The Wanxi Tianmu 2.0 large model by Wanxi Technology ranked fourth in China on the SuperCLUE global list, and was jointly recognized with Huawei's PanGu T2V, demonstrating its technological strength in the field of video generation. The company also established an AI video large model laboratory with Huawei Cloud, integrating resources to explore digital creative applications and promote the intelligent development of the industry. This collaboration marks an innovative breakthrough of AI technology in the field of video content, which is expected to enhance user experience and lead the industry direction.

OPPO's Large Model Upgraded to AndesGPT-2.0, Supporting Multimodal Capabilities

OPPO announced that its AI large model AndesGPT-2.0 performed exceptionally in the 'Chinese Large Model Benchmark Evaluation August 2024 Report' released by the authoritative third-party evaluation agency SuperCLUE, winning multiple first-place awards, marking further recognition of OPPO's technological strength in the AI field.

Tencent's Hunyuan Large Model: Ranks First in Domestic Large Models for 'Image to Text' Multimodal Understanding

The Tencent Hunyuan Large Model excelled in the August rankings of the SuperCLUE-V evaluation benchmark for Chinese multimodal large models, securing the top position among domestic large models and placing in the 'Exemplary Leaders' quadrant. Multimodal understanding requires the model to accurately identify image elements, comprehend their relationships, and generate natural language descriptions, testing the model's precision in image recognition and its understanding of the complex real world.

2025 Global Chinese Large Model Ranking Released: Overseas Giants Take Top Three, Domestic Large Models Surpass in Niche Areas

Related Recommendations

Global Chinese Language Model Competition! Overseas Strong Contenders Take the Top Three, Domestic Ones Show Promise!

Rising Star in the Marketing Track! Titanium Tech's Titanium Model Wins the SuperCLUE Ranking Championship

Wanxi Technology Makes a Stunning Debut on the List! The Tianmu 2.0 Model Ranks Fourth in China, Collaborating with Huawei Cloud to Establish an AI Video Laboratory

OPPO's Large Model Upgraded to AndesGPT-2.0, Supporting Multimodal Capabilities

Tencent's Hunyuan Large Model: Ranks First in Domestic Large Models for 'Image to Text' Multimodal Understanding