Google has rolled out the Veo 3.1 video model to Gemini Pro/Ultra subscription users today, introducing a new "Ingredients to Video" mode that supports uploading three reference images simultaneously, extracting character, scene, and style features, and fusing them into an 8-second 1080p video. The generated content includes an embedded SynthID invisible watermark. Users can input text prompts on the web or mobile app and generate videos with one click. The system maintains character consistency across frames and coherent lighting.

Google's demonstration shows that three different angle selfies + a cyber city background + an oil painting style image can output a short film titled "Impressionist Future Street Walk," with zero deformation in facial features and clothing. Veo 3.1 also outputs native ambient sounds, supports control of the first and last frames, and video extension features.
Google stated that the multi-image reference feature is now fully available, with generation quotas consistent with existing subscription plans, and no additional paid plans have been announced.
