Article Content

IBM Launches Granite4.0Nano Series: Small Open-Source Models Designed for Edge AI

Published in Latest AI News

Time :Oct 30, 2025

Read :4minute

Recently, the IBM AI team launched the Granite4.0Nano series, a family of small models designed for local and edge inference, aiming to provide enterprises with stronger control and open-source licensing. This series includes 8 models, available in two sizes: 350M and approximately 1B, using a hybrid SSM and transformer architecture, supporting both basic and instruction modes. All models are released under the Apache2.0 license and can run natively in popular runtime environments such as vLLM, llama.cpp, and MLX.

The Granite4.0Nano series includes four model lines and their base versions. Among them, Granite4.0H1B uses a hybrid SSM architecture with approximately 1.5B parameters, while Granite4.0H350M also uses a hybrid approach with 350M parameters. To ensure maximum runtime compatibility, IBM also provides transformer versions of Granite4.01B and Granite4.0350M.

These H variant models alternate between SSM layers and transformer layers, and this hybrid structure has significant advantages in memory growth compared to pure transformers, while maintaining the generality of the transformer modules. The Granite4.0Nano models did not use reduced data pipelines, but followed the same training method as the large-scale Granite4.0 models, trained on over 15 trillion tokens, and underwent instruction tuning to enhance tool usage and instruction following capabilities.

IBM also compared the Granite4.0Nano with other similar models, including Qwen, Gemma, and LiquidAI LFM. The results showed significant improvements in areas such as general knowledge, mathematics, code, and security. Additionally, the series of models performed well in agent tasks on the IFEval and Berkeley function call leaderboard version 3.

These models are ISO42001 certified and released with cryptographic signatures, ensuring traceability and governance capabilities required for enterprise use. Users can access these models through Hugging Face and IBM watsonx.ai and deploy them at the edge, local, and browser levels, helping early AI engineers and software teams better implement projects.

huggingface:https://huggingface.co/collections/ibm-granite/granite-40-nano-language-models

Key Points:
🔹 IBM released the Granite4.0Nano series, containing 8 models suitable for edge AI inference.
🔹 The series of models were trained on over 15 trillion tokens, ensuring performance inheritance.
🔹 All models are ISO42001 certified and have enterprise-level governance capabilities, supporting multiple runtime environments.

Related Recommendations

ChatGPT Becomes More Mature: OpenAI Launches Age Detection System to Strictly Prevent Minors from Accessing Inappropriate Content

OpenAI has introduced an 'Age Prediction' feature in the paid version of ChatGPT, aiming to identify users under 18 and provide targeted protection. The model uses behavioral signals such as account longevity, activity times, and long-term interaction patterns for intelligent judgment, rather than relying on traditional age input.

Jan 21, 2026

162.0k

Strategic Investment of 100 Million Yuan! China Ruyi Teams Up with Aisi Technology to Embark on a New Era of AI Real-Time Interactive Imaging

AI video company Aisi Technology has reached a deep strategic cooperation with China Ruyi, a listed company on the Hong Kong Stock Exchange, and secured a $14.2 million strategic investment. The two companies will collaborate in areas such as film and television visual design, visual effects production, intelligent generation of promotional materials, and optimization of streaming media assets. China Ruyi will also open up its copyright resources to help Aisi Technology unleash the potential of IP creation.

Jan 20, 2026

158.3k

Anthropic Enters India: Former Microsoft Executive Leads the Charge, Directly Competing with OpenAI

AI company Anthropic has appointed former Microsoft executive Irina Ghose as the head of its Indian operations, accelerating its expansion in the South Asian market. India has become its second-largest user market, with users primarily using AI tools for software development.

Jan 16, 2026

182.6k

Meituan Launches LongCat-Flash-Thinking-2601, Further Elevating Open-Source Tool Utilization Capabilities

The LongCat team at Meituan has open-sourced its latest AI model, LongCat-Flash-Thinking-2601, which has achieved the highest level among open-source models in core evaluations such as intelligent agent search, tool utilization, and reasoning. Its core advantage lies in outstanding tool utilization capabilities, enabling it to effectively handle complex tasks that depend on tools, significantly reducing the adaptation cost for new tools in real-world scenarios.

Jan 16, 2026

222.9k

OpenAI's Mental Health Safety Lead Joins Anthropic AI, Sparking Attention to Dialogue System Safety

Andrea Varrone, OpenAI's lead for mental health safety, left the company to join Anthropic, drawing industry attention. She previously focused on researching AI's interaction with user emotions, particularly how AI should appropriately respond when users face mental health issues. This change highlights the importance of AI ethics and mental health topics.

Jan 16, 2026

191.9k

Intelligent Future, Your Artificial Intelligence Solution Think Tank

English 简体中文繁體中文にほんご