Europe's AI Overlord Unleashes the All-Rounder! Mistral Small 4 Launches: Reasoning, Multimodal, and Programming

In the open-source large model arena, European star Mistral AI once again demonstrated its remarkable rate of evolution.

On March 16 local time, Mistral AI officially released Mistral Small4. This is the lab's first true "versatile" large model, which perfectly combines flagship-level reasoning, multimodal understanding, and powerful programming capabilities in a single model for the first time. For developers, this means no longer having to make "choices" between various vertical models, as the new Small4 achieves "I want it all."

Mistral Small4 adopts an advanced MoE (Mixture of Experts) architecture:

Core parameters: Total parameter count is 119B, with only 6B activated parameters, significantly optimizing operational efficiency while maintaining performance.
Extended context: It has an extended context window of 256k, allowing it to easily handle entire technical documents or large codebases.
Flexible modes: Supports both fast response and deep reasoning modes, and is officially open-sourced under the Apache 2.0 license, showing great sincerity.

In terms of performance, Mistral Small4 has made a qualitative leap compared to its predecessor. Official data shows that under the latency-optimized mode, its end-to-end completion time is reduced by 40%; and under the throughput-optimized mode, it can process three times as many requests per second as Small3. In cross-comparisons with external large models, its performance in three core benchmark tests is no less than OpenAI's GPT-OSS120B.

Deployment requirements and hardware recommendations:

To unlock the full potential of this model, Mistral AI provides clear hardware guidance. The minimum configuration requirement is 4× HGX H100 or 1× DGX B200; for an optimal experience, the official recommends using a combination of 4× HGX H200 or 2× DGX B200.

With the release of Mistral Small4, Mistral AI

Wenxin Yanyi 5.1 Preview Version Launched, LMSYS Arena, Currently Ranks 13th Globally

Baidu's ERNIE Bot 5.1 preview version quietly launched on the international blind testing platform LMSYS Chatbot Arena, ranking 13th overall. This marks a new rapid iteration cycle for Baidu's core model, now under global user evaluation. While specific parameters and architecture details are undisclosed, based on past iteration logic and competitive performance, semantic understanding is expected to improve.....

Ant Group Officially Opens-Source the Trillion-Parameter Large Model Ling-2.6-1T, Focusing on Improving the Efficiency of Quick Thinking

Ant Group's Ling team open-sourced the trillion-parameter flagship model Ling-2.6-1T today, focusing on optimized instruction execution, tool adaptation, and long-context capabilities rather than parameter stacking. Its innovative hybrid architecture reduces token costs via reinforcement reward strategies, enabling efficient 'fast thinking'.....

Mistral AI Launches Enterprise Orchestration Tool Workflows, Supporting Python Development and Manual Approval

Mistral AI launched Workflows, an enterprise AI orchestration layer, on April 28 as a key component of the Mistral Studio platform, aiming to transform fragmented AI processes into scalable production systems. Now in public preview, it is adopted by global giants like ASML, ABANCA, and CMA-CGM. Built on the Temporal engine, it supports Python-defined complex workflows to enhance core business efficiency.....

Solving the Memory Issues of Large Models: Huawei Expert Starts a Startup in Shenzhen, MemoraX AI Secures $10 Million in Funding

Amid the rapid development of AI large models, the issue of memory limitations has become increasingly prominent. Shenzhen-based startup MemoraX AI focuses on solving this pain point, recently completing a multi-million dollar seed round led by L2F Light Source Entrepreneur Fund and Zhongding Capital, with participation from several other institutions. Located in Nanshan, Shenzhen, the company has achieved remarkable speed from inception to fundi....

DeepSeek V4 Officially Released: DeepSeek-V4-Flash and DeepSeek-V4-Pro Dual Versions Pricing Revealed

DeepSeek launched its new flagship model V4 with a segmented strategy, introducing Flash and Pro versions for lightweight high-frequency applications and complex reasoning tasks, respectively. This move meets diverse scenario needs and reshapes AI commercialization benchmarks with competitive pricing, integrating the original deepseek-chat and deepseek-reasoner models.....

Europe's AI Overlord Unleashes the All-Rounder! Mistral Small 4 Launches: Reasoning, Multimodal, and Programming - I Want It All

Related Recommendations

Wenxin Yanyi 5.1 Preview Version Launched, LMSYS Arena, Currently Ranks 13th Globally

Ant Group Officially Opens-Source the Trillion-Parameter Large Model Ling-2.6-1T, Focusing on Improving the Efficiency of Quick Thinking

Mistral AI Launches Enterprise Orchestration Tool Workflows, Supporting Python Development and Manual Approval

Solving the Memory Issues of Large Models: Huawei Expert Starts a Startup in Shenzhen, MemoraX AI Secures $10 Million in Funding

DeepSeek V4 Officially Released: DeepSeek-V4-Flash and DeepSeek-V4-Pro Dual Versions Pricing Revealed