Zhihu Opens 'Zhihai Map AI' Large Model Service


Xiaomi released the MiMo-V2.5 series of large models, including MiMo-V2.5, V2.5-Pro, and accompanying TTS and ASR models, marking an upgrade from "usable" to "user-friendly." The flagship model MiMo-V2.5-Pro has reached competitive levels with top models such as Claude Opus4.6 and GPT-5.4 in terms of general intelligent agent capabilities and software engineering. Its core advantage lies in high instruction adherence and self-correction capabilities.
Qwen launches the 'Table Agent' feature, allowing users to generate, query, and edit Excel files through natural language conversations, achieving a transition from text answers to direct results. The feature covers three key aspects: zero-barrier conversion of information into tables, intelligent retrieval, and in-depth editing, simplifying traditional table processing workflows.
Tencent has released the HY-Embodied-0.5 foundational model specifically designed for robots, aiming to address the shortcomings of general visual language models in 3D spatial perception and physical interaction, and to advance large models toward the field of robot control. The series of models have been restructured in both architecture and training, and are accompanied by the release of main models such as MoT-2B.
MiniMax launched the MMX-CLI command line tool, specifically designed for AI Agent, simplifying the process of calling multi-modal models. The tool solves issues such as complicated interface adaptation and code redundancy, allowing Agents to easily schedule various AI capabilities as if they were native applications. Users can invoke programming, video generation, and other functions with one click in mainstream development environments, without the need to write additional MCP Servers or adapt complex interfaces.
OpenAI officially adapts to Apple CarPlay, allowing iPhone users to interact with ChatGPT through voice on the car's center display. To ensure driving safety, ChatGPT on CarPlay only supports full voice interaction and prohibits displaying text or images.