Efficient and Lightweight: IBM Launches Granite 4.0 1B Speech Multimodal Speech Large Model

IBM has officially launched Granite4.01B Speech. This is a compact speech language model designed for edge computing and enterprise deployment, aiming to provide high-efficiency multilingual automatic speech recognition (ASR) and bidirectional automatic speech translation (AST) capabilities.

Compared to the previous version, Granite4.01B Speech has half the parameters of the previous model, yet it achieves significant performance improvements. The new model not only adds support for Japanese ASR, but also introduces a keyword bias feature and greatly improves the accuracy of English transcription. Its core design goal is to significantly reduce memory usage, inference latency, and computational costs without sacrificing core capabilities.

The model uses an innovative "two-stage design" architecture. The system first converts audio into text, and then processes it through a dedicated Granite language model. This modular design allows developers to flexibly arrange the process according to their needs. Currently, the model supports multilingual translation including English, French, German, Spanish, Portuguese, and Japanese, and can handle translation tasks from English to Chinese (Mandarin).

In performance testing, Granite4.01B Speech performed excellently, ranking first on the OpenASR leaderboard, with an average word error rate (WER) of just 5.52. Currently, IBM has officially open-sourced the model under the Apache 2.0 license, and developers can deploy it locally using mainstream frameworks such as Transformers or vLLM, providing strong AI voice support for resource-constrained mobile or edge devices.

Project: https://huggingface.co/ibm-granite/granite-4.0-1b-speech

New Breakthrough in Medical AI: Baichuan Intelligence Releases M4 Model, Achieving Doctor-like Active Diagnosis

Baichuan Intelligence and the Tsinghua team released the medical large model Baichuan-M4, securing first place in three sub-leaderboards in the authoritative HealthBench evaluation, outperforming GPT-5.5. Its core breakthrough lies in completely transforming the interaction mode, achieving more clinically realistic intelligent diagnostic capabilities.

Amazon's Double Standard: Advertising on ChatGPT to Drive Traffic, but Strictly Preventing AI from Scraping Data

Amazon shows a double standard in the AI retail wave: on one hand, it is investing heavily in advertising on ChatGPT to attract its large user base; on the other hand, it strictly prevents other AI systems from scraping data from its product pages. According to analysts, it has joined OpenAI's ad system and is the most active giant in the retail industry. When users ask for shopping advice on ChatGPT, the system will prioritize displaying Amazon's sponsored products.

OpenAI Launches the 'Patch the Planet' Initiative: Collaborating with Security Experts to Address Vulnerabilities in the Open Source World

OpenAI launched the 'Patch the Planet' initiative, using AI technology to help the open source community automatically identify and fix code security vulnerabilities, addressing issues such as weak oversight in the open source ecosystem. The name of the initiative pays homage to a classic movie line, aiming to strengthen the security foundation of global digital infrastructure.

Anthropic Joins Frontier Alliance, Becomes the First AI Startup to Focus on Carbon Removal

Anthropic becomes the first pure AI company to join the carbon removal alliance Frontier, investing 915 million U.S. dollars, nearly doubling the alliance's total commitments to 1.8 billion U.S. dollars. Frontier has signed nearly 700 million U.S. dollars in contracts, covering more than 50 projects including direct air capture, with plans to remove 1.8 million tons of carbon. This deal also marks Anthropic's first climate-related investment.

Efficient and Lightweight: IBM Launches Granite 4.0 1B Speech Multimodal Speech Large Model

Related Recommendations

New Breakthrough in Medical AI: Baichuan Intelligence Releases M4 Model, Achieving Doctor-like Active Diagnosis

Amazon's Double Standard: Advertising on ChatGPT to Drive Traffic, but Strictly Preventing AI from Scraping Data

OpenAI Launches the 'Patch the Planet' Initiative: Collaborating with Security Experts to Address Vulnerabilities in the Open Source World

Major Upgrade in Voice Interaction: Claude is Developing Multilingual Support, Bringing a Phone-Call Experience Closer

Anthropic Joins Frontier Alliance, Becomes the First AI Startup to Focus on Carbon Removal