The embodied intelligence track has reached a milestone. According to the latest announcement from the Shanghai Cyberspace Administration, WITA (Silicon Photonic Language) large model under Zhiyuan has completed the filing and has become the first compliantly落地 embodied intelligence interaction large model in the country.

This is not just about "getting a license." The core mission of WITA is to make humanoid robots truly "able to chat, understand emotions, and have personality." It focuses on robot interaction scenarios, using natural and human-like emotional expressions to upgrade cold mechanical bodies into "silicon companions" with continuous memory and personal traits. As the core engine of the "interactive intelligent deployment state," this model has already been applied in commercial scenarios such as guidance, shopping assistance, and service retail, solving the industry's pain point that robots can work but cannot communicate.

image.png

More excitingly, Zhiyuan revealed that WITA Omni 1.0, the first end-to-end multimodal interaction large model for robots, will be launched in the third quarter of this year. Its breakthroughs are tangible: the interaction latency is compressed to within 500ms, almost matching the rhythm of real-time human conversation, supporting normal speech speed continuous communication, and allowing interruptions, corrections, and real-time emotional and tonal responses, making the interaction truly "like talking to a person."

Technically, the new model achieves multimodal collaboration of language, voice, expression, and movement, eliminating the previous "mouth moving but no body movement" sense of detachment; more importantly, through a multimodal interaction data flywheel mechanism, the model can continuously learn and become smarter in real scenarios, forming a positive cycle of self-optimization.

Strategically, it is also significant. At the first Hong Kong Embodied Intelligence Industry Summit, Peng Zhihui, co-founder, president, and CTO of Zhiyuan, officially announced the "Zhiyuan 358 Vision Plan": aiming to break through 10 billion yuan in revenue by 2027 and reach 100 billion yuan by 2030. This ambitious roadmap not only demonstrates Zhiyuan's confidence in the commercialization of embodied intelligence, but also reflects that the entire sector is accelerating from technological verification to large-scale monetization, marking a critical turning point.