Meituan Launches LongCat-Flash-Thinking-2601, Further Elevating Open-Source Tool Utilization Capabilities

Today, the LongCat team of Meituan officially released and open-sourced the latest AI model - LongCat-Flash-Thinking-2601. As an upgraded version of the LongCat-Flash-Thinking series, this model has achieved the state-of-the-art (SOTA) level in core evaluation benchmarks such as intelligent agent search, tool calling, and reasoning.

The core advantage of LongCat-Flash-Thinking-2601 lies in its exceptional tool calling capability. This ability enables the model to perform well in complex tasks that rely on tools, significantly reducing the training cost for adapting to new tools in real scenarios. In addition, the model's "rethinking mode" is provided for online free experience in an open-source format for the first time. Users can try it on the website https://longcat.ai. In this mode, the model simulates the process of human deep thinking, dividing the thinking into two stages: parallel thinking and summarization, ensuring comprehensive thinking and reliable decision-making.

After rigorous evaluation, LongCat-Flash-Thinking-2601 has shown excellent performance in multiple indicators such as programming, mathematical reasoning, intelligent agent tool calling, and search capabilities. In terms of programming ability, the model scored 82.8 points in the LCB evaluation, ranking among the top models in the same category; in mathematical reasoning, it achieved a perfect score of 100 points in the AIME-25 evaluation, further consolidating its leading position in this field.

To evaluate the model's generalization ability, the LongCat team also proposed a new evaluation method. This method uses an automated task synthesis process, supporting users to randomly generate complex tasks based on keywords and evaluate the model's performance in such environments. Experiments show that LongCat-Flash-Thinking-2601 maintains a leading performance in multiple randomly generated tasks, proving its strong generalization ability.

In the training process, the LongCat team adopted a strategy of "environment expansion + multi-environment reinforcement learning," providing the model with diversified high-intensity training environments, significantly improving its adaptability in complex scenarios. In addition, the team injected noise into the training data to enhance the model's robustness, allowing it to efficiently complete tasks even when facing complex situations such as API call failures or missing data.

To lower the development threshold, the LongCat team of Meituan also opened up the model's weights, inference code, and online experience capability, encouraging developers to actively participate in this open-source project. Developers can obtain resources through platforms such as GitHub, Hugging Face, and ModelScope, and experience it online at https://longcat.ai.

ChatGPT Becomes More Mature: OpenAI Launches Age Detection System to Strictly Prevent Minors from Accessing Inappropriate Content

OpenAI has introduced an 'Age Prediction' feature in the paid version of ChatGPT, aiming to identify users under 18 and provide targeted protection. The model uses behavioral signals such as account longevity, activity times, and long-term interaction patterns for intelligent judgment, rather than relying on traditional age input.

Strategic Investment of 100 Million Yuan! China Ruyi Teams Up with Aisi Technology to Embark on a New Era of AI Real-Time Interactive Imaging

AI video company Aisi Technology has reached a deep strategic cooperation with China Ruyi, a listed company on the Hong Kong Stock Exchange, and secured a $14.2 million strategic investment. The two companies will collaborate in areas such as film and television visual design, visual effects production, intelligent generation of promotional materials, and optimization of streaming media assets. China Ruyi will also open up its copyright resources to help Aisi Technology unleash the potential of IP creation.

OpenAI's Mental Health Safety Lead Joins Anthropic AI, Sparking Attention to Dialogue System Safety

Andrea Varrone, OpenAI's lead for mental health safety, left the company to join Anthropic, drawing industry attention. She previously focused on researching AI's interaction with user emotions, particularly how AI should appropriately respond when users face mental health issues. This change highlights the importance of AI ethics and mental health topics.

2.6B Parameters Outperform Billion-Level Giants! Liquid AI Releases New Experimental Model LFM2-2.6B-Exp

On Christmas Day, edge AI startup Liquid AI released the open-source model LFM2-2.6B-Exp, which has only 2.6 billion parameters but performed exceptionally well in multiple benchmark tests. Its instruction-following capability even surpassed DeepSeek R1-0528 with hundreds of billions of parameters, earning it the title "the strongest 3B model." The model is based on the second-generation LFM2 foundation model and achieved experimental breakthroughs through pure reinforcement learning.

Meituan Launches LongCat-Flash-Thinking-2601, Further Elevating Open-Source Tool Utilization Capabilities

Related Recommendations

ChatGPT Becomes More Mature: OpenAI Launches Age Detection System to Strictly Prevent Minors from Accessing Inappropriate Content

Strategic Investment of 100 Million Yuan! China Ruyi Teams Up with Aisi Technology to Embark on a New Era of AI Real-Time Interactive Imaging

Anthropic Enters India: Former Microsoft Executive Leads the Charge, Directly Competing with OpenAI

OpenAI's Mental Health Safety Lead Joins Anthropic AI, Sparking Attention to Dialogue System Safety

2.6B Parameters Outperform Billion-Level Giants! Liquid AI Releases New Experimental Model LFM2-2.6B-Exp