Xunfei Xinghuo X2-Flash Model Launch: Focusing on Domestic Computing Power, 256K Long Text Capability Upgrade

On April 29, iFLYTEK officially launched the new Spark X2-Flash model and simultaneously opened the API interface, marking the entry of large model applications based on the domestic computing power ecosystem into a new stage of efficiency.

The model adopts the current mainstream MoE (Mixture of Experts) architecture, with a total parameter count of 30B. The most remarkable feature is its support for an ultra-long context of up to 256K. Notably, the Spark X2-Flash was fully trained on Huawei's Ascend 910B cluster, demonstrating the collaborative capabilities of domestic software and hardware in the field of deep learning training.

In terms of core performance, the Spark X2-Flash has achieved significant improvements in agent (Agent) and code generation capabilities. According to third-party test data, the model's performance in handling complex tasks such as processing in-depth research reports, Skill management and invocation, and system control execution has already reached the level of top-tier models with trillions of parameters in the industry.

Regarding the cost issues that developers care about, the Spark X2-Flash performs excellently. In the same workflow test, its token consumption is only one-third of that of currently mainstream large-scale models, greatly reducing the threshold for building complex agent applications. For example, when creating complex video generation skills, the model not only quickly understands the requirements but also provides detailed explanations from the skill structure to the core functions.

On the technical foundation, the Spark X2-Flash is the first to combine DSA (Sparse Attention) and MTP (Multi-Token Prediction) technologies on domestic chips. This innovation solves the problem of slow training for long texts on domestic computing platforms, improving training efficiency by 4.5 times compared to clusters of the same scale. In addition, for agent reinforcement learning scenarios, the model has improved sampling inference efficiency by more than two times through dual optimization of algorithms and engineering, effectively alleviating performance bottlenecks in long interaction scenarios.

Currently, applications such as AstronClaw and Loomy have been the first to complete integration. At the same time, the model has also achieved deep compatibility with international mainstream agent frameworks such as OpenClaw and Claude Code, providing global developers with a more cost-effective domestic computing power solution.

Alibaba Open Sources Qwen3.6-35B-A3B: A 30 Billion Activated Parameter Model Achieving a Leap in Programming Capabilities

The Tongyi Qianwen team at Alibaba has open-sourced the sparse mixture of experts model Qwen3.6-35B-A3B, which has a total parameter count of 35 billion and only 3 billion activated parameters. This model surpasses Qwen3.5-27B in multiple programming benchmark tests with low computational costs, and significantly outperforms its predecessor Qwen3.5-35B-A3B, achieving a key breakthrough in the field of intelligent agent programming for lightweight models.

Meituan LongCat-Flash-Lite Launches: 4.5 Billion Activated Parameters with Performance Comparable to Large Models

Meituan's LongCat team introduces LongCat-Flash-Lite, a new model using an 'embedding expansion' paradigm to overcome MoE architecture bottlenecks. Research shows expanding embedding layers outperforms adding experts, improving Pareto frontiers and addressing diminishing returns and high communication costs.....

Xiaomi Launches New Generation MoE Large Model MiMo-V2-Flash to Support AGI Development

Luofuli, the new head of large models at Xiaomi, officially announced the new MoE large model MiMo-V2-Flash at the 2025 Xiaomi Ecosystem Conference. The model adopts a Hybrid SWA architecture, with a simple and elegant design, and shows outstanding performance in long-context reasoning, marking an important step for Xiaomi toward the goal of Artificial General Intelligence (AGI).

Xunfei Xinghuo X2-Flash Model Launch: Focusing on Domestic Computing Power, 256K Long Text Capability Upgrade

Related Recommendations

Alibaba Open Sources Qwen3.6-35B-A3B: A 30 Billion Activated Parameter Model Achieving a Leap in Programming Capabilities

Meituan LongCat-Flash-Lite Launches: 4.5 Billion Activated Parameters with Performance Comparable to Large Models

Xiaomi Launches New Generation MoE Large Model MiMo-V2-Flash to Support AGI Development

iFLYTEK Launches Fully Domestic Computing Power Starfire X1.5 AI Technology Upgrade

Liquid AI Launches LFM2-8B-A1B: 8B Parameters with Only 1.5B Activated, Achieving 4B-Level AI Speed on Mobile Devices!