Article Content

Xiaomi Launches Self-Developed MiMo-V2-TTS Text-to-Speech Large Model, Achieving Deep Control of Multiple Dialects and Emotions

Published in Latest AI News

Time :Mar 19, 2026

Read :3minute

Xiaomi officially released its self-developed large-scale speech synthesis model, Xiaomi MiMo-V2-TTS, marking significant progress in the field of highly controllable and expressive speech generation. The model is based on Xiaomi's self-developed Audio Tokenizer and a multi-codebook speech-text joint modeling architecture. Through large-scale pre-training using hundreds of millions of hours of speech data, it achieves precise adjustment from macro-level style to micro-level emotional details. Unlike traditional TTS, MiMo-V2-TTS can complete tone shifts and emotional changes within a single sentence, highly restoring the natural rhythm of human speech, and supporting song synthesis with accurate pitch and rhythm. On the technical level, Xiaomi introduced multi-dimensional reinforcement learning to balance the stability and expressiveness of the generated output. The model can intelligently recognize text signals such as punctuation, intonation words, and emphasis markers, converting them into appropriate speech expressions without the need for additional manual annotations. In addition, the model demonstrates strong cross-regional adaptability, supporting multiple dialects including Northeastern Mandarin, Sichuanese, Henanese, Cantonese, and Taiwanese accent, and can perform character-based performances.

As a key milestone in Xiaomi's voice technology roadmap, MiMo-V2-TTS will further expand multilingual coverage and deeply integrate with the multimodal understanding capabilities of MiMo-V2-Omni. This evolution from single speech synthesis to coordinated multimodal perception and expression indicates that AI agents are transitioning from simple semantic interaction to more personable and emotionally resonant human-computer interaction, significantly enhancing user experience in scenarios such as smart cabins and smart homes.

Related Recommendations

Dou Bao Discloses Paid Subscription Plans: Monthly Fees Range from 68 to 500 Yuan, Three Tiers to Promote Commercialization

Doubao announced paid version plans on App Store on May 4, introducing a tiered subscription system beyond the free model. Three tiers are set: Standard at ¥68/month (¥688/year), Enhanced at ¥200/month (¥2,048/year), and Professional at ¥500/month (¥5,088/year), catering to diverse needs. No paid features are currently live in the product.....

May 6, 2026

214.5k

SAS Launches Enterprise-Level AI Governance Tool to Comprehensively Manage Intelligent Entities and Mitigate Shadow AI Risks

SAS launched AI governance tools at SAS Innovate, including Viya Copilot, Agent Accelerator, and SAS AI Navigator, to build transparent and secure automation, helping enterprises move from experimentation to production while addressing compliance and trust issues in AI deployment.....

Apr 30, 2026

194.8k

Google Plans to Invest in Anthropic, Potentially Committing $4 Billion to the AI Competition

Google plans to invest $10 billion in AI company Anthropic, potentially increasing to $30 billion at a $350 billion valuation. Anthropic previously received $5 billion from Amazon at the same valuation, with a possible additional $20 billion. This move aims to deepen collaboration, highlighting its value in AI programming.....

Apr 27, 2026

177.0k

New Kimi K2.6 Launch Encounters Functional Issues, Moonshot Resets User Quotas as Compensation

Moon's Dark Side reset all users' usage quotas as compensation for service disruptions caused by a surge in traffic following the April 20 release of the new open-source model Kimi K2.6, which features enhanced coding, long-range task execution, and Agent cluster support.....

Apr 23, 2026

218.2k

Florida Launches Criminal Investigation into OpenAI and Its ChatGPT Involving a Fatal Shooting

Florida AG launches criminal probe into OpenAI's ChatGPT over its potential role in a university shooting that killed 2 and injured 6 last year.....

Apr 22, 2026

207.9k

Intelligent Future, Your Artificial Intelligence Solution Think Tank

English 简体中文繁體中文にほんご