Apple Releases LiTo Large Model: Single Image Becomes 3D, Light and Shadow Restoration Accuracy Increased by 37%

Apple's AI research team recently launched a 3D generation large model called LiTo (Surface Light Field Tokenization). This technology has overcome long-standing challenges in the field of 3D reconstruction, achieving the generation of complete 3D objects with high-fidelity lighting effects based solely on a single 2D image.

The core of LiTo lies in the innovative application of a latent space and a novel unified 3D latent representation:

Efficient Encoding: It compresses complex surface light field data into compact vector sets, mathematically describing the physical laws of an object's geometry and light interaction.
Bidirectional Mechanism: It uses an encoder-decoder architecture. The encoder extracts geometric structure and appearance features; the decoder reverses the process, accurately reproducing advanced visual effects such as specular highlights and Fresnel reflections.

Performance: Consistency of Lighting Across Multiple Viewpoints

To train LiTo, the research team used a 3D dataset containing thousands of objects. Experimental results show:

Resolving Directional Bias: LiTo strictly follows the camera coordinate system, solving the common issue of incorrect object orientation found in similar models.
Leading Metrics: In terms of multi-view lighting consistency, LiTo improves by approximately 37% over the current top model, TRELLIS.

This achievement marks a further reduction in the barriers to 3D content creation, and it is expected to provide higher quality material generation support for augmented reality (AR) and spatial computing devices (such as Vision Pro) in the future.

ByteDance Seed Team Announces the Launch of 3D Generation Large Model Seed 3D 1.0

The ByteDance Seed team recently announced the launch of the 3D generation large model Seed3D1.0, which is capable of generating high-quality, realistic 3D models from a single image in an end-to-end manner, including detailed geometry, realistic textures, and physically based rendering (PBR) materials. This innovative achievement is expected to provide powerful world simulation support for the development of embodied intelligence, addressing bottlenecks in physical interaction capabilities and content diversity in current technologies. During the development process, the Seed team collected and processed a large amount of high-quality 3D data, building a complete three

Musk's xAI Launches Voice API: The AI Mouth Replacement Battle Rages On

Musk's xAI company has officially launched the Grok Text to Speech API, enabling AI assistants to have voice interaction capabilities. This move not only expands Grok's multimodal functions but also provides developers with a convenient interface to integrate its conversational abilities into various applications, promoting the development of a more human-like AI ecosystem.

New Breakthrough in the Field of Chemical AI! Tsinghua AIR Collaborates with Shuimu Molecules to Open Source the General-Purpose Large Model BioMedGPT-Mol

The Tsinghua Institute for Intelligent Technology Research collaborates with Shuimu Molecules to open source the general-purpose chemical molecule model BioMedGPT-Mol, marking a significant step in the deepening of domestic large models from general dialogue to specialized fields such as biomedicine. This model is specifically designed for chemical molecules, aiming to promote the intelligence and standardization of drug development.

Apple Releases LiTo Large Model: Single Image Becomes 3D, Light and Shadow Restoration Accuracy Increased by 37%

Performance: Consistency of Lighting Across Multiple Viewpoints

Related Recommendations

Single Image to 3D! Apple Releases LiTo Large Model: Lighting and Textures Are Fully Achieved. Is This the Perfect Partner for Vision Pro?

ByteDance Seed Team Announces the Launch of 3D Generation Large Model Seed 3D 1.0

BuzzFeed's New AI App Struggles at SXSW: Awkward Silence on Site, Further Complicating Transformation

Musk's xAI Launches Voice API: The AI Mouth Replacement Battle Rages On

New Breakthrough in the Field of Chemical AI! Tsinghua AIR Collaborates with Shuimu Molecules to Open Source the General-Purpose Large Model BioMedGPT-Mol