Google DeepMind Releases Preview of SIMA 2, Performance Doubles on the Way to General Robots

Google DeepMind has launched the multimodal agent SIMA2, based on the Gemini 2.5 Flash-lite model. The task success rate is about twice that of SIMA1, and it can complete complex instructions in new environments it has never encountered before, as well as possess self-improvement capabilities. This version is currently released as a research preview, aiming to validate high-level world understanding and reasoning abilities required for achieving general robotics and AGI.

SIMA2 continues to be pre-trained using hundreds of hours of game videos, but for the first time, it introduces a self-generated data loop: after entering a new scene, the system calls an independent Gemini model to generate tasks in bulk, which are then scored by an internal reward model. High-quality trajectories are selected for continuous fine-tuning, improving performance without additional manual annotations. The research team stated that this mechanism allows the agent to execute commands such as "go to the red house" or "cut down trees" in test environments like "No Man’s Sky" by reading environmental text, recognizing colors and symbols, and even understanding emoji combinations.

In demonstrations, DeepMind combined the generative world model Genie to generate realistic outdoor scenes for SIMA2. The agent can accurately identify objects such as benches, trees, and butterflies and interact with them. Jane Wang, a senior research scientist, stated that this "understanding the scene → inferring the goal → planning actions" loop is the essential high-level behavioral module needed to transfer virtual environment capabilities to real robots.

However, SIMA2 currently focuses on high-level decision-making and does not involve low-level controls such as mechanical joints or wheels. DeepMind simultaneously trained a robot foundation model using a different technical approach, and how the two will integrate remains undetermined. The team refused to disclose the official release date, stating only that they hope the preview will attract external collaboration to explore feasible paths for transferring virtual agents to physical robots.

Can AI Also Suffer Brain Damage? Study Reveals the Impact of Low-Quality Data on Large Language Models

The study found that after continuously exposing large language models to low-quality data (such as social media content), they can exhibit phenomena similar to human brain damage, leading to a 23% decline in reasoning ability and a 30% decline in long-term context memory. This damage is irreversible, and even subsequent training with high-quality data cannot fully restore it.

Morgan Stanley: 15% of S&P 500 Companies Have Achieved Profit Growth Through AI

According to a Morgan Stanley report, 15% of companies in the S&P 500 index have achieved quantifiable economic benefits through AI technology, a significant increase from 11% last year. Nearly a quarter of companies are classified as AI adopters, indicating that businesses are accelerating the use of AI to improve performance.

Turing Award Winner LeCun Exits Meta: Large Models Are a Dead End, the Future Lies in World Models

Turing Award winner Yann LeCun has diverged from Meta on the direction of AI. As Meta's Chief AI Scientist, he recently publicly criticized large language models as a dead end and advocated for research into world models. His leaving rumors have drawn attention; he once led the fundamental AI research department FAIR and was considered a key intellectual advisor at the company.

Google DeepMind Releases Preview of SIMA 2, Performance Doubles on the Way to General Robots

Related Recommendations

Google DeepMind Launches SIMA 2: A New General-Purpose Agent Conquering Complex 3D Virtual Worlds

Can AI Also Suffer Brain Damage? Study Reveals the Impact of Low-Quality Data on Large Language Models

Morgan Stanley: 15% of S&P 500 Companies Have Achieved Profit Growth Through AI

Turing Award Winner LeCun Exits Meta: Large Models Are a Dead End, the Future Lies in World Models

ChatGPT Will Stop Using Long Dashes Upon User Request