At the 2025 iFLYTEK 1024 Developer Festival, iFLYTEK officially launched the AI Software-Hardware Integrated Solution, achieving accurate recognition and understanding in complex environments such as high noise and long-distance scenarios through the deep integration of AI algorithms and hardware architecture. This breakthrough is considered an important advancement in the field of audio-visual intelligence integration.

iFLYTEK stated that traditional AI speech recognition systems often face a drop in accuracy in noisy environments. To address this, iFLYTEK has made systematic innovations in software-hardware integration, enabling AI not only to "hear clearly" but also to "understand."

iFLYTEK (2)

Based on this solution, several iFLYTEK AI hardware products have significantly improved noise reduction and recognition performance:

  • iFLYTEK Smart Office Book X5 features the industry's first "4 above and 4 below" eight-microphone array, achieving recognition effects that are far superior to iPhone 17 Pro in long-distance, high-noise environments;

  • iFLYTEK AI Translation Earphones achieve an identification accuracy of 97.1% in complex environments such as subways and exhibitions;

  • iFLYTEK Dual-Screen Translation Machine 2.0 still achieves a voice recognition accuracy of 98.69% in environments with 90dB factory noise.

iFLYTEK said these achievements are due to its continuous accumulation in speech enhancement, sound source localization, echo cancellation, and multimodal perception algorithms.

At this developer festival, iFLYTEK also released the "Custom Voice Replication" technology based on the Spark Speech Large Model. Users can replicate any voice with just one recording and generate different styles of voices with a single instruction.

This technology marks the popularization stage of personalized voice creation. It can be widely applied in fields such as digital humans, audiobooks, film and television dubbing, and content creation, allowing everyone to quickly create their own "AI voice avatar."