Gemini Veo 3.1 Launches Multi-Image Reference, Synthesizes Three Elements into a Video in One Go

Google has rolled out the Veo 3.1 video model to Gemini Pro/Ultra subscription users today, introducing a new "Ingredients to Video" mode that supports uploading three reference images simultaneously, extracting character, scene, and style features, and fusing them into an 8-second 1080p video. The generated content includes an embedded SynthID invisible watermark. Users can input text prompts on the web or mobile app and generate videos with one click. The system maintains character consistency across frames and coherent lighting.

Google's demonstration shows that three different angle selfies + a cyber city background + an oil painting style image can output a short film titled "Impressionist Future Street Walk," with zero deformation in facial features and clothing. Veo 3.1 also outputs native ambient sounds, supports control of the first and last frames, and video extension features.

Google stated that the multi-image reference feature is now fully available, with generation quotas consistent with existing subscription plans, and no additional paid plans have been announced.

From Chat Assistant to Computer Manager: Gemini 3.5 Flash Opens a New Era of AI Proactive Execution

Google launched the Gemini 3.5 Flash model, with core optimization of computer operation capabilities, allowing AI to directly control the computer interface and autonomously perform complex cross-software workflows. This marks the evolution of AI from a text-based question-answering machine into an active agent, shifting toward a more delegated role.

Anthropic AI Model Breaks into U.S. Classified System in Hours, Government Immediately Locks It Down

U.S. testing showed that Anthropic's AI model Mythos detected multiple vulnerabilities in a government classified system within hours, with efficiency far exceeding traditional manual methods. Senator Warner cited the statement of General Rad from the NSA during a hearing, confirming that the tool located vulnerabilities within just a few hours, highlighting the potential of AI in the field of cybersecurity.

US AI Startup Sues Government: Cutting Off Access to Large Models Is Like Cutting Off a Livelihood

The US AI company Legion has filed a lawsuit against the federal government due to a ban. Because Anthropic, under export control regulations, has prohibited providing the most advanced AI models to foreign citizens, the Canadian development team of Legion lost their core tools, and the company claims it is facing a survival crisis, having already filed a lawsuit in the US Federal Court in Washington.

Doubao Audio Generation Model 1.0 Launched, Opening the Era of Audio Direction

Volcano Engine launched Doubao Audio Generation Model 1.0, featuring two core technologies: "Multimodal Reference Generation" and "Long-term Voice Consistency," simplifying traditional audio post-production processes. It enables one-stop generation of dialogue, sound effects, and background music, improving creative efficiency.

Gemini Veo 3.1 Launches Multi-Image Reference, Synthesizes Three Elements into a Video in One Go

Related Recommendations

Cutting Flesh Without Data? Google Forces New AI Training Rules

From Chat Assistant to Computer Manager: Gemini 3.5 Flash Opens a New Era of AI Proactive Execution

Anthropic AI Model Breaks into U.S. Classified System in Hours, Government Immediately Locks It Down

US AI Startup Sues Government: Cutting Off Access to Large Models Is Like Cutting Off a Livelihood

Doubao Audio Generation Model 1.0 Launched, Opening the Era of Audio Direction