Article

Google Collaborates with NVIDIA to Release Open-Source Model DiffusionGemma: Introduces Diffusion Mechanism, Speeds Up Single-Card Inference by Four Times

Published in Latest AI News

Time :Jun 11, 2026

Read :3minute

Google officially released the experimental open-source language model DiffusionGemma on June 10, 2026, breaking the traditional autoregressive paradigm of large models that generate text word by word. It is the first to introduce the diffusion mechanism used in image AI into the field of text generation. The model can output 256 token blocks in parallel in a single step through multiple iterative optimizations starting from random noise.

Regarding hardware performance, through deep optimization by NVIDIA, the model's runtime speed under single GPU single-user mode is nearly four times faster than that of similar traditional models. When processing a single request on an H100 graphics card, its output speed can reach 1000 tokens per second. Even on high-end consumer-grade GPUs such as RTX5090, it can exceed 700 tokens per second.

DiffusionGemma has 26 billion parameters and is based on a mixture-of-experts (MoE) architecture, with only 3.8 billion parameters activated in a single step. Although its text generation quality and accuracy are slightly inferior to traditional Gemma4 series models in standard benchmark tests, its unique "full-block awareness" capability breaks the limitation of autoregressive models that can only look backward. Since all tokens can refer to each other during generation, the model shows significant advantages in tasks such as text completion, code filling, Sudoku solving, and amino acid sequence processing, which involve nonlinear and structured data.

Currently, the model weights are open-sourced on Hugging Face under the Apache 2.0 license and are fully compatible with mainstream inference frameworks such as vLLM and MLX. This exploration not only breaks the constraints of memory bandwidth on GPU computing power but also opens up a new technical path for future AI applications in complex logic and nonlinear text generation tasks.

Related Recommendations

Google Releases DiffusionGemma: Trying to Speed Up AI Inference Using Text Diffusion Architecture

Google released the open-source experimental model DiffusionGemma on June 10, using a text diffusion architecture to achieve up to 4x faster text generation on dedicated GPUs compared to traditional autoregressive models, aiming to boost AI efficiency, though the company remains cautious.....

Jun 11, 2026

173.3k

French AI Unicorn Mistral AI's Valuation Surges to $14 Billion

French AI startup Mistral AI is finalizing €2B funding at a €14B valuation, potentially becoming Europe's top tech startup. Founded by ex-DeepMind/Meta researchers, it rivals OpenAI with open-source models and European AI chatbot Le Chat.....

Sep 5, 2025

143.4k

Xiaomi Technology Team Advances in AI Programming: MiMo Code Officially Open-Source

On June 11, Xiaomi's MiMo team launched MiMo Code, an AI coding assistant built on OpenCode technology. It features persistent memory, infinite context processing, precise logic understanding, and model-agent collaboration with a unique Compose mode for efficient coding support.....

Jun 11, 2026

430.9k

Warner Music Officially Acquires Sureel AI: Building a Copyright Firewall for Musicians

Warner Music Group announced on June 10 its acquisition of AI technology company Sureel AI, shifting from a defensive to an active stance in AI. The move aims to leverage Sureel AI's technology to reshape the music industry's copyright order for its artists, songwriters, and rights holders, addressing challenges from generative AI.....

Jun 11, 2026

250.1k

Rushing to Hong Kong Stock Market: Haining Zhiyuan Aims to Become the First Physical AI Company

HaiQingZhiYuan, a national-level specialized and new 'Little Giant' enterprise focusing on multi-spectral AI, launched its IPO subscription on June 11 and plans to list on the Hong Kong Stock Exchange on June 22. The company aims to issue 85.1625 million H-shares at HK$7.2 each, sponsored by Minsheng Capital and SPDB International. Founded in 2013 in Shenzhen, it evolved from hardware sales to deep AI expertise.....

Jun 11, 2026

252.9k

Intelligent Future, Your Artificial Intelligence Solution Think Tank

English 简体中文繁體中文にほんご