Google Gemini now lets users guide AI video with multiple reference images per input

Nov 16, 2025

Google is updating the Gemini app with a new way to control its AI video model. With the latest release, users can upload multiple reference images for a single video prompt. The system then generates video and audio based on those images combined with text, giving people more direct control over how the final clip looks and sounds.

Google previously tested this feature in Flow, the company's expanded video AI platform. Flow also supports extending existing clips and stitching together multiple scenes, and it offers a slightly higher video quota than the Gemini app. Veo 3.1 has been available since mid-October and, according to Google, delivers more realistic textures, higher input fidelity, and better audio quality than Veo 3.0.

AI News Without the Hype – Curated by Humans

Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.

AI news without the hype
Curated by humans.

More than 16% discount.
Read without distractions – no Google ads.
Access to comments and community discussions.
Weekly AI newsletter.
6 times a year: “AI Radar” – deep dives on key AI topics.
Up to 25 % off on KI Pro online events.
Access to our full ten-year archive.
Get the latest AI news from The Decoder.

Subscribe to The Decoder

Google Gemini now lets users guide AI video with multiple reference images per input

AI News Without the Hype – Curated by Humans

AI news without the hypeCurated by humans.

AI news without the hype
Curated by humans.