Zhipu AI has officially launched its latest GLM-ASR series speech recognition model and open-sourced the related technology, aiming to provide users with a more efficient voice interaction experience. The launch of this series also includes a desktop "Zhipu AI Input Method," offering great convenience for voice input on PC platforms.

image.png

GLM-ASR-2512 is Zhipu AI's globally leading cloud-based speech recognition model. Its main feature is real-time speech-to-text conversion, performing excellently in complex real-world environments, with a character error rate (CER) as low as 0.0717. This outstanding recognition accuracy keeps it at the forefront of the industry in multi-scenario, multi-language, and multi-accent applications.

In addition to GLM-ASR-2512, Zhipu AI has also open-sourced GLM-ASR-Nano-2512. This model has only 1.5B parameters but performs SOTA in the open-source speech recognition field, even surpassing some closed-source models in certain tests. The design of GLM-ASR-Nano-2512 allows it to run locally, ensuring high-precision speech recognition while enhancing user privacy and reducing interaction latency.

Based on the powerful capabilities of these two models, Zhipu AI has launched the new Zhipu AI Input Method. Users can not only achieve accurate speech-to-text functionality through this input method but also perform intelligent operations such as translation and text rewriting, truly realizing the convenient experience of "指尖即模型,语音即指令" (the fingertip is the model, and voice is the command). Currently, the Zhipu AI Input Method is available to all users, and new users can get 2000 points, enjoying up to 28 days of free usage time.

GLM-ASR-Nano-2512: Hugging Face: https://huggingface.co/zai-org/GLM-ASR-Nano-2512

Zhipu AI Input Method: https://autoglm.zhipuai.cn/autotyper/

Key Points:

🌟 Launch of the GLM-ASR series model, including a globally leading cloud-based speech recognition model and an edge-side model, with excellent recognition accuracy.  

🛠️ Launch of the new Zhipu AI Input Method, supporting speech-to-text, translation, and rewriting, providing a convenient PC-based voice interaction experience.  

🎁 New users can get 2000 points for free, enjoying up to 28 days of usage, encouraging more users to experience the smart input method.