32B Inference Performance Surpasses o1-mini! Alibaba Tongyi Launches FIPO Algorithm to Make Large Models Think Deeper

According to reports, the Qwen Pilot team from Ali Tongyi Lab has introduced a new algorithm called FIPO. This algorithm aims to break through the bottlenecks of traditional reinforcement learning (RL) in handling complex logic, achieving a dual breakthrough in reasoning length and accuracy.

Core Breakthrough: Solving "Reasoning Length Stagnation"

Traditional models often struggle to distinguish which Tokens are key to reaching the correct answer when dealing with complex problems like mathematics. FIPO addresses this issue with targeted reengineering:

Future-KL Mechanism: Introduces the Future-KL strategy, specifically rewarding Tokens that have a significant positive impact on subsequent reasoning, enabling AI to "think ahead."

Symbolic Log Probability Difference: Introduces this new mechanism to precisely capture the model's optimization direction, preventing the reasoning process from getting stuck in unproductive loops.

Reasoning Length Leap: On a base model, FIPO successfully increased the average reasoning length to over 10,000 Tokens, completely solving the problem of insufficient reasoning depth.

Outstanding Performance: 32B Model Surpasses o1-mini

In practical tests, the 32B-scale model equipped with the FIPO algorithm demonstrated remarkable "powerful" performance:

Surpassing Competitors: In a pure reinforcement learning setup, its reasoning performance not only surpassed models of the same scale but also outperformed OpenAI's o1-mini in some metrics.

Mathematical Potential: The algorithm performs exceptionally well in handling high-level mathematical reasoning tasks, showcasing strong logical deduction capabilities.

Industry Context: Tongyi Lab's "Intelligent Evolution"

Ali Tongyi Lab has been active in AI fundamental algorithms recently. In addition to this impressive FIPO algorithm, the team launched the CoPaw 1.0 new version at the beginning of March, demonstrating their continuous efforts in improving the logical rigor and interaction depth of models.

Conclusion: The "Second Curve" of Reasoning Efficiency

While the industry is still debating parameter scale, Ali Tongyi

Microsoft Bing Team Open Sources Harrier Multilingual Embedding Model

Microsoft Bing team open sources the word embedding model Harrier, which supports over 100 languages and performs excellently in the MTEB v2 benchmark. The model is trained on 2 billion examples and GPT-5 synthetic data, using a 32,000 token context window, with 2.7 billion parameters, significantly improving the accuracy and flexibility of multilingual tasks.

Google Search AI Overview Accuracy is Only 90%, Easily Affected by False Information

According to The New York Times, the accuracy of Google's AI Overview feature is about 90%. With Google's annual search volume exceeding 5 trillion searches, this means that millions of incorrect answers may be generated every hour, and nearly a million pieces of incorrect information per minute. An assessment by startup company Oumi showed that the accuracy of Google's Gemini model increased from 85% in October last year to 91% in February this year.

Tencent officially launches Laoxia QBotClaw: the first AI browser in China that supports free configuration of mainstream large model APIs

Tencent has launched the first AI browser in China, Laoxia "QBotClaw", upgrading the browser into an all-scenario AI assistant. Its biggest highlight is its high degree of openness, supporting users to freely configure mainstream large model APIs and breaking away from single model binding. The Mac version is now available and integrated with QQ Browser Skill, while the Windows version will be released soon, aiming to lower the entry barrier.

15 Seconds 1080P Synchronized Audio and Video! Aishi Technology PixVerse C1 Launch: High-End Model for the Film Industry Makes a Big Impact

Aishi Technology launches the PixVerse C1, a large model tailored for the film industry, aiming to reshape the film production process. The model supports the generation of up to 15-second 1080P high-definition videos, achieving a leap from single shots to automatic scene transitions. It is now available on the Web and API platforms.