Breakthrough in Edge-side Large Models! Liquid AI Opens Source Hybrid Expert Model LFM2.5

Artificial intelligence startup Liquid AI officially released and open-sourced a new edge-side large model LFM2.5-8B-A1B today. Designed for tool calls and complex instruction following on consumer-grade hardware, the model significantly improves the reasoning and inference performance of edge devices while maintaining extremely low computational costs.

In terms of architecture, the model adopts a sparse mixture-of-experts (MoE) design with a total parameter count of 8.3B. Thanks to this sparsity, the model only activates 1.5B parameters per token generation, allowing it to run smoothly on local devices such as smartphones and laptops.

Extended Long Text and Enhanced Reasoning Capabilities

Compared to its predecessor, LFM2.5 has expanded the context window from 32K to 128K tokens, and the pre-training data volume has increased from 12T to 38T. As a pure inference model, it generates an explicit reasoning chain before outputting the final answer, and its highly compressed vocabulary efficiently handles nine languages including Chinese and Arabic.

To address issues such as logical dead loops and hallucinations in long reasoning, the development team introduced two-stage reinforcement learning (RL) during training. Preference optimization effectively reduces "dead loops" in long-chain reasoning, while a specialized anti-hallucination reward mechanism allows the model to actively refuse to answer questions beyond its knowledge base.

Powerful Edge Performance and Full Ecosystem Compatibility

In terms of performance, LFM2.5 has seen explosive growth, with scores in logical reasoning and anti-hallucination benchmark tests far surpassing its predecessor, even rivaling larger models in instruction following. In terms of tool calling, the model defaults to outputting efficient Python function calls and supports seamless switching to JSON format within system prompts.

The model received full support from mainstream inference ecosystems on its release day, including llama.cpp, MLX, vLLM, and SGLang. In hardware testing, its decoding speed reached up to 253 bytes per second on the M5 Max chip, and about 30 bytes per second on mobile devices, perfectly balancing privacy and high efficiency for edge-side operations.

Roblox Launches AI Creation Tool Build, Supporting Text-Generated Games and 3D Scenes

Roblox launched AI creation tool Build and upgraded its Studio, using AI to lower the development barrier. Users can generate editable game content via text prompts. The feature begins testing on July 28, deepening the 'user-generated content' philosophy. The platform has 132 million daily active users. Build is a mobile-first tool, enabling creation anytime, anywhere.....

Google Tests Gemini Voice Customization Feature, Adds Four Adjustment Options: Speed, Energy, Formality, and Friendliness

Google is developing a voice customization feature for Gemini, allowing users to fine-tune the AI's communication style using four sliders: speed, energy, formality, and friendliness. This feature was discovered in the Google app's 17.41.12 beta version and will break the limitations of preset voice options, enabling personalized voice control in the future.

1Password Teams Up with Claude to Launch an Integrated Feature: AI Can Log You Into Websites, But Passwords Remain Hidden from It

1Password integrates with the AI assistant Claude, offering a new approach to address risks in password sharing: Claude can fill in login credentials and one-time verification codes on behalf of the user, completing browser tasks, but the plaintext passwords remain locked in an encrypted vault, not readable by the AI. This solution makes AI an executor rather than an informed party, drawing a clear line between convenience and security, avoiding handing over the key to others.

AI Travel Platform Fora Completes $60 Million D-Round Funding, Valuation Rises to $1 Billion

AI travel platform Fora raised $60M in Series D at a $1B valuation. Led by Forerunner and Tactile Ventures with Insight Partners, total funding hits $138.5M. Founded in 2021, it combines travel services and agent ops, offering tools for booking, planning and communication to help users become travel agents with low barriers.....

Breakthrough in Edge-side Large Models! Liquid AI Opens Source Hybrid Expert Model LFM2.5

Related Recommendations

Roblox Launches AI Creation Tool Build, Supporting Text-Generated Games and 3D Scenes

Google Tests Gemini Voice Customization Feature, Adds Four Adjustment Options: Speed, Energy, Formality, and Friendliness

1Password Teams Up with Claude to Launch an Integrated Feature: AI Can Log You Into Websites, But Passwords Remain Hidden from It

AI Travel Platform Fora Completes $60 Million D-Round Funding, Valuation Rises to $1 Billion

Google Upgrades Gemini Spark AI Assistant with New Workspace Editing Capabilities and Over 50% Speed Improvement