56 Billion Parameters Shocking Open Source! Meituan LongCat Leads the Way: A New Ceiling in the Field of Mathematical Proof

On the journey of large models moving into vertical and specialized fields, Meituan has just submitted a remarkable answer that has drawn attention from both the academic and industrial communities.

On March 21, Meituan officially open-sourced a large-scale mathematical proof model named LongCat-Flash-Prover. This giant with 567.7 billion parameters employs an advanced MoE (Mixture of Experts) architecture, and has been deeply optimized for extremely complex mathematical formal proof problems.

In top-tier benchmark tests measuring model logical reasoning capabilities, LongCat-Flash-Prover has demonstrated dominant strength:

Breaking Records: Achieved an impressive score of 97.1% in the MiniF2F-Test, requiring only 72 reasoning attempts.

Overcoming Challenges: Successfully solved 41.5% of the problems in the PutnamBench task. Both of these figures set new global SOTA (State-of-the-Art) records.

To truly give large models the rigor of a "mathematician," Meituan has achieved several key technical breakthroughs:

Eliminating Hallucinations: Introduced a multi-stage strict verification process based on AST (Abstract Syntax Tree), and integrated Lean4 formal language, fundamentally eliminating AI's "nonsense" in logical deduction.

Training Algorithm Evolution: To address the chronic instability of long-range tasks in MoE models, Meituan introduced its self-developed HisPO algorithm, combined with a theorem consistency detection mechanism, effectively preventing the "cheating" reward hacking behavior of the model during the reinforcement learning phase.

Efficient Architecture: The total parameter count of 560 billion ensures the model's profound knowledge base, while the MoE architecture ensures flexibility and efficiency during inference.

Currently, Meituan has fully open-sourced this model and its code on GitHub and Hugging Face platforms.

With the launch of LongCat-Flash-Prover

Performance Improved by Over Two Times: NVIDIA Releases Nemotron-Labs-TwoTower Diffusion Language Model

Nvidia open-sourced Nemotron-Labs-TwinTower diffusion language model, which uses a "twin tower" architecture to overcome the serial decoding bottleneck of autoregressive models. It splits generation into two subnetworks, one kept frozen, enabling parallel text generation and higher throughput, providing an efficient solution for large-scale synthesis tasks.....

Intelligence Alternative to GPT-5? Qwen 3.6 27B Evaluation Shows Local Model Has Reached the Cutting-Edge Level

Qwen3.6 series overturns belief that local large models compromise. Tested on MacBook Max M5 128GB, Qwen3.6 27B with 8-bit GGUF quantization delivers incredible efficiency. It proves to be not only usable but a powerful general-intelligence tool without sacrifice, marking a new phase in local LLM deployment.....

Early Signs of Commercialization: Huang Zhenxin from Moonshot Explains Kimi's Differentiation Strategy

Large model industry enters deep water of deployment & cost battle. Moonshot AI's Kimi has clear commercialization. B-side head Huang Zhenxin says: insist on underlying architecture innovation, not mere engineering stacking. Kimi is high-performance model, will maintain this path despite high costs from global compute crunch.....

OceanBase Launches Lake-Storage Integrated AI Database: Enabling Agents to Truly Understand Enterprises

AI breakthroughs contrast with unmet enterprise value, shifting focus from models to data. OceanBase launched a lake-house AI database, integrating massive storage, transactional analytics, and multimodal processing to build a strongly consistent data foundation, efficiently supporting AI Agents.....

56 Billion Parameters Shocking Open Source! Meituan LongCat Leads the Way: A New Ceiling in the Field of Mathematical Proof

Related Recommendations

Performance Improved by Over Two Times: NVIDIA Releases Nemotron-Labs-TwoTower Diffusion Language Model

Intelligence Alternative to GPT-5? Qwen 3.6 27B Evaluation Shows Local Model Has Reached the Cutting-Edge Level

Early Signs of Commercialization: Huang Zhenxin from Moonshot Explains Kimi's Differentiation Strategy

AI Agent Evolution Accelerates: Anthropic Claude Joins Forces with NVIDIA GB300 to Launch on Azure

OceanBase Launches Lake-Storage Integrated AI Database: Enabling Agents to Truly Understand Enterprises