AI in practice

Mar 16, 2025Mar 16, 2025

Google's Gemini models add native video understanding

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

Google has integrated native video understanding into its Gemini models, enabling users to analyze YouTube content through Google AI Studio. Simply enter a YouTube video link into your prompt. The system then transcribes the audio and analyzes the video frames at one-second intervals. You can, for example, reference specific timestamps and extract summaries, translations, or visual descriptions. Currently in preview, the feature permits processing up to 8 hours of video per day, with limitations of one public video per request. Gemini Pro processes videos up to two hours in length, while Gemini Flash handles videos up to one hour. The update follows the implementation of native image generation in Gemini.

Video: via Logan Kilpatrick

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Matthias Bastian

Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.

Profile

E-Mail

AI in practice

Jul 8, 2025Jul 8, 2025

OpenAI and the American Federation of Teachers plan to train 400,000 U.S. teachers in AI

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI in practice

Jul 8, 2025

Salesforce aims to control data flow as companies move toward agent-driven enterprise software

AI in practice

Jul 8, 2025Jul 8, 2025

OpenAI is ramping up security to prevent rivals from copying its advanced AI models

Google News

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Google's Gemini models add native video understanding

OpenAI and the American Federation of Teachers plan to train 400,000 U.S. teachers in AI

Salesforce aims to control data flow as companies move toward agent-driven enterprise software

OpenAI is ramping up security to prevent rivals from copying its advanced AI models

"Cat attack" on reasoning model shows how important context engineering is

Apple's claims about large reasoning models face fresh scrutiny from a new study

Cloudflare CEO Matthew Prince sees trouble ahead for the open web

Google's Gemini models add native video understanding

OpenAI and the American Federation of Teachers plan to train 400,000 U.S. teachers in AI

Salesforce aims to control data flow as companies move toward agent-driven enterprise software

OpenAI is ramping up security to prevent rivals from copying its advanced AI models