Google has released an update for Gemini 2.5 Flash Native Audio that makes voice assistants more capable. The model now handles complex workflows better, follows user instructions more precisely, and conducts more natural conversations. Compliance with developer instructions jumped from 84 to 90 percent, and call quality in multi-step conversations has also improved.

Ad

According to Google, the updated audio model scores 71.5 percent accuracy on function calls in the ComplexFuncBench benchmark, putting it ahead of OpenAI's gpt-realtime at 66.5 percent. It's worth noting, though, that Google likely didn't test against the latest realtime version, which OpenAI released just yesterday.

The update is now available in Google AI Studio, Vertex AI, Gemini Live, and Search Live. Google Cloud customers are already using the technology, and developers can test the model through the Gemini API.

Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Sources
Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.