Ant Group Launches Ling-1T, a 1 Trillion Parameter Model Surpassing GPT-5 as the New Benchmark

Ant Group recently announced the open-sourcing of its latest flagship large model - Ling-1T, which has up to 1 trillion parameters and is the largest base model known to be trained using FP8 low-precision mode. Ling-1T was developed by Ant's "Bailing" team, marking another breakthrough in artificial intelligence technology.

According to the team's introduction, Ling-1T belongs to the Ling2.0 model family, which is divided into three series: Ling series, Ring series, and Ming series. The Ling series focuses on handling general tasks with speed and efficiency as its core, while the Ring series focuses on deep thinking and complex reasoning. The Ming series is a multimodal model that can handle a more diverse range of information types.

Ling-1T has 1 trillion parameters, but only about 50 billion parameters are actually activated for each token, greatly reducing computational costs. To support such a massive model, the Ant team proposed the "Ling Scaling Law." After experiments with over 300 models, they summarized the relationship between computational efficiency and the ratio of expert activation. In addition, they developed a learning rate scheduler called WSM, which can automatically adjust the learning strategy during training to ensure the model is trained stably and efficiently.

The training process of Ling-1T is divided into three stages: pre-training, mid-training, and post-training. During the pre-training stage, the model was exposed to more than 20 trillion tokens of data, including a large amount of reasoning-intensive text. The mid-training stage focuses on enhancing the model's reasoning capabilities, while the post-training stage uses "evolutionary chain of thought" technology for self-iteration, improving reasoning accuracy.

In comparison with other mainstream models, Ling-1T performed well in multiple tests, especially in mathematical reasoning and code generation capabilities, demonstrating its outstanding performance. In community testing, Ling-1T also showed remarkable performance in complex tasks, such as successfully simulating physical phenomena and cosmic evolution.

Although Ling-1T demonstrates strong capabilities, it still has some limitations, such as high costs when processing extremely long contexts. The Ant team has stated that they are researching a new hybrid attention architecture to address this issue.

Open source address:

HuggingFace: https://huggingface.co/inclusionAI/Ling-1T

GitHub: https://github.com/inclusionAI/Ling-V2

Key points:
🔍 Ling-1T is the largest trillion-parameter model known, trained using FP8 low-precision mode.
🚀 The model outperforms several mainstream models in mathematical reasoning and code generation, showing excellent performance.
⚙️ The Ant team is researching a new architecture to solve the cost issue of Ling-1T in processing extremely long contexts.

Ant AI AQ Ranks 7th in China's AI Application List, Leading Industry Growth Rate, Exceeding Wen Xiaoyan and Other General AI Products

According to QuestMobile data, in the Top 10 list of China's AI-native applications for the third quarter of 2025, Ant Group's AI health application AQ performed outstandingly, rising to the 7th position, becoming the only health-related application on the list. Its user base has already exceeded general AI products like Tongyi and Wen Xiaoyan. Within just over three months of its release, it achieved rapid growth, with a compound growth rate of 83.4% in the third quarter.

Ant Group Launches dInfer: Significantly Speeding Up the Inference of Diffusion Language Models by 10 Times!

Ant Group open-sources dInfer, the first high-performance inference framework for diffusion language models in the industry, significantly improving inference speed. Benchmark tests show that it is 10.7 times faster than NVIDIA Fast-dLLM, achieving 1011 Tokens per second in single inference on the HumanEval code generation task, pushing technology toward practical applications.

Ant Group Opensources the World's First Trillion-Parameter Large Model Ring-1T-preview with Code Generation Capabilities Exceeding GPT-5

Ant Group opensources the trillion-parameter inference large model Ring-1T-preview, the world's first open-source trillion-parameter inference model. The preview version shows outstanding performance in natural language reasoning, achieving a score of 92.6 on AIME25, surpassing all known open-source models such as Gemini 2.5 Pro, and approaching GPT-5's score of 94.6; it also performed well on CodeForces tests.

Ant Group Launches Ling-1T, a 1 Trillion Parameter Model Surpassing GPT-5 as the New Benchmark

Related Recommendations

Ant AI AQ Ranks 7th in China's AI Application List, Leading Industry Growth Rate, Exceeding Wen Xiaoyan and Other General AI Products

Ant Group Launches dInfer: Significantly Speeding Up the Inference of Diffusion Language Models by 10 Times!

First to Surpass Autoregressive Models! Ant Group Opens Sourced the Industry's First High-Performance Diffusion Language Model Inference Framework dInfer

Ant Group Releases Ling-1T, a 1 Trillion Parameter Language Model That Sets a New Industry Benchmark in Inference Speed and Capabilities

Ant Group Opensources the World's First Trillion-Parameter Large Model Ring-1T-preview with Code Generation Capabilities Exceeding GPT-5