NVIDIA Launches DiffusionRenderer: A New AI Model Enables Video to Editable Realistic 3D Scenes

AIbase基地

Published in AI News · 5 minute read · Jul 11, 2025

With the rapid development of AI technology, the quality of video generation is improving at an astonishing speed, evolving from initially blurry clips to highly realistic generated videos. However, the lack of control and editing capabilities for generated videos remains a critical issue that needs to be addressed. It was not until NVIDIA and its partners introduced DiffusionRenderer in their latest research that a new solution emerged for this challenge.

DiffusionRenderer is a groundbreaking research achievement that not only generates videos but also understands and manipulates 3D scenes within them. This model integrates generation and editing, greatly unlocking the potential of AI-driven content creation. Previous technologies, such as physically based rendering (PBR), have shown excellent performance in generating high-fidelity videos, but they were powerless when it came to scene editing. DiffusionRenderer processes 3D scenes in a unique way, breaking through this limitation.

The model uses two neural renderers. The first is a neural inverse renderer, which analyzes the input video, extracts geometric and material properties of the scene, and generates the required data buffers; the second is a neural forward renderer, which combines this data with the desired lighting to generate high-quality realistic videos. Their collaboration enables DiffusionRenderer to demonstrate strong adaptability when processing real-world data.

The research team designed a unique data strategy for DiffusionRenderer, building a large synthetic dataset containing 150,000 videos as the foundation for the model's learning. At the same time, they used a real-world video dataset containing 10,510 videos to automatically generate scene attribute labels, allowing the model to better adapt to the characteristics of real videos.

DiffusionRenderer's performance is impressive, showing advantages over other methods in multiple task comparisons. It not only generates more realistic lighting effects in complex scenes but can also accurately estimate the material properties of scenes during reverse rendering.

The practical application potential of this technology is vast. Users can perform dynamic lighting, material editing, and seamless object insertion through DiffusionRenderer. Users only need to provide a video to easily modify and recreate the scene. The release of this technology marks a significant leap in the field of video rendering and editing, granting more creators and designers greater creative freedom.

Demo Video https://youtu.be/jvEdWKaPqkc
github : https://github.com/nv-tlabs/cosmos1-diffusion-renderer
Project Page: https://research.nvidia.com/labs/toronto-ai/DiffusionRenderer/

Key Points:
🌟 DiffusionRenderer brings new possibilities to 3D scene creation by combining generation and editing functions.
🎥 The model leverages the collaboration between the neural inverse renderer and the neural forward renderer, enhancing the realism and adaptability of video rendering.
🚀 Its practical applications include dynamic lighting, material editing, and object insertion, enabling creators to easily engage in video creation.

Related AI News

Google DeepMind Open Sources GenAI Processors: One-Click Building of Real-Time AI Workflows

Google DeepMind open sources the GenAI Processors Python library, helping developers build efficient generative AI workflows. The library supports asynchronous processing of multimodal data and optimizes Gemini API application development, significantly reducing latency in real-time applications. Core features include a modular Processor interface, streaming API design, and concurrency optimization, enabling rapid development of real-time applications such as intelligent assistants. Currently only supports Python, but with an open community contribution model, future plans include expanding functionality to cover more scenarios.

Jul 11, 2025

46.2k

Manus AI Official Website and Social Media Undergo Changes, Chinese Users May Be Affected

General AI company Manus adjusts its China operations, lays off employees, and relocates its core technology team to Singapore. The China region had approximately 120 employees, and the company states this move is aimed at improving operational efficiency and focusing on core business. The official website now shows that the region is unavailable, replacing previous messages about the development of the Chinese version. The official Weibo and Xiaohongshu accounts have also been cleared, indicating a significant shift in the company's market strategy in China.

Jul 11, 2025

46.2k

Modo AI Launches: Input Your Idea and Generate a High-Fidelity, Editable Prototype in 30 Seconds

Modo AI introduces a 30-second rapid prototype generation feature, supporting multi-device adaptation and conversation optimization. Users can generate high-fidelity, editable prototypes through text, sketches, and other input methods, and support iterative conversation adjustments. The AI can intelligently parse uploaded sketches, wireframes, and more, automatically generating interfaces. It offers dual-mode editing, automatic documentation generation, and code integration features, covering multiple scenarios such as e-commerce and social networking, significantly lowering the barrier to prototype creation and improving product design efficiency.

Jul 11, 2025

46.2k

Mistral AI Releases Devstral2507: Designed for Code-Centric Language Modeling

Mistral AI launched the Devstral2507 series with two AI models: the open-source Devstral Small1.1 (24 billion parameters, SWE-Bench score of 53.6%) and the enterprise version Devstral Medium2507 (score of 61.6%). Small1.1 supports a 128k context window and local deployment, while Medium2507 outperforms some commercial models. Both are optimized for code reasoning and program synthesis, and support integration with agent frameworks.

Jul 11, 2025

46.2k

Generate a Professional PPT in 5 Minutes! Zhipei AI Slides Has Been Launched, GLM-Experimental Brings You a Glimpse of the Future of Work

Zhipu AI launches AI Slides, a revolutionary PPT tool using GLM-Experimental model. It generates professional slides from text/documents with smart layouts and visual optimization. Free for business/academic use, praised for design quality and efficiency. Available on Zhipu's official site.....

Jul 11, 2025

46.9k

Google Announces the Latest Class of Students at the American Artificial Intelligence Infrastructure Institute

Jul 11, 2025

46.2k