Tenfold Improvement in Efficiency: Ant Group's Bai Ling Large Model Ling-2.6-flash Officially Open-Sourced

Ant Group's BaiLing large model has officially announced the open-source release of its latest member, Ling-2.6-flash. The model also launched multiple quantized versions including BF16, FP8, and INT4, aiming to provide global developers with more flexible hardware compatibility options and further lower the threshold for AI deployment.

As a high-performance model, Ling-2.6-flash has a total parameter count of 104B, with 7.4B activated parameters. Previously, the model made a name for itself on international mainstream evaluation platforms under an anonymous identity and completed multiple rounds of deep optimization for Chinese-English switching and code adaptation based on developer feedback.

Significant Improvement in Inference Efficiency

In terms of technical architecture, Ling-2.6-flash introduced an advanced hybrid linear architecture, greatly unleashing computing potential. Under the mainstream H20 GPU environment, its inference speed can reach up to 340 tokens per second, with throughput far exceeding industry competitors.

Aside from its speed advantage, the model also showed remarkable efficiency. Evaluation data shows that when completing tasks of the same complexity, Ling-2.6-flash consumes only one-tenth the number of tokens compared to models of the same level, effectively reducing long-term operational costs for enterprises.

Enhanced Intelligent Agent Scenarios

Regarding the currently popular agent applications, Ant Group enhanced the model's targeted capabilities. Whether it is complex tool calls or long-path task planning, Ling-2.6-flash has demonstrated strong logical execution ability and task success rate.

The model is now available on mainstream open-source communities such as Hugging Face and ModelScope. Through this deep open-source initiative, Ant Group hopes to empower more developers in vertical fields, exploring new boundaries of large model applications while ensuring data privacy.

Ant Group's Baoling Large Model Adds New Open-Source Member: Ling-2.6-flash Launches Officially

Ant Group's Bailing large model series has been updated with the official release of Ling-2.6-flash. The model has 104B total parameters and 7.4B activated parameters, offering BF16, FP8, and INT4 precision versions to suit various hardware environments and lower deployment barriers. It was previously tested anonymously on OpenRouter under the name 'Elephant Alpha'.....

Mistral AI Launches Enterprise Orchestration Tool Workflows, Supporting Python Development and Manual Approval

Mistral AI launched Workflows, an enterprise AI orchestration layer, on April 28 as a key component of the Mistral Studio platform, aiming to transform fragmented AI processes into scalable production systems. Now in public preview, it is adopted by global giants like ASML, ABANCA, and CMA-CGM. Built on the Temporal engine, it supports Python-defined complex workflows to enhance core business efficiency.....

AI Performs Impressively in Japan's University Entrance Exam, ChatGPT Surpasses Humans

Japanese AI company LifePrompt announced that ChatGPT, based on OpenAI's latest model, excelled in undergraduate entrance exams for the University of Tokyo and Kyoto University, particularly scoring 50 points higher than the top human candidate in the University of Tokyo's medical exam, marking a major breakthrough in AI's academic capabilities.....

Nine out of ten game developers are quietly using AI technology, Google executive reveals industry status

The gaming industry is widely adopting AI, particularly generative AI. Ubisoft requires relevant experience from all applicants, while the developer of ARC Raiders has completely overhauled its development process with AI. Although some AAA titles remain cautious, AI has become an industry trend.....

DeepSeek V4 Officially Released: DeepSeek-V4-Flash and DeepSeek-V4-Pro Dual Versions Pricing Revealed

DeepSeek launched its new flagship model V4 with a segmented strategy, introducing Flash and Pro versions for lightweight high-frequency applications and complex reasoning tasks, respectively. This move meets diverse scenario needs and reshapes AI commercialization benchmarks with competitive pricing, integrating the original deepseek-chat and deepseek-reasoner models.....