Ad
Skip to content

Google's updated Gemini 2.5 Flash Native Audio handles complex voice tasks better

Google has released an update for Gemini 2.5 Flash Native Audio that makes voice assistants more capable. The model now handles complex workflows better, follows user instructions more precisely, and conducts more natural conversations. Compliance with developer instructions jumped from 84 to 90 percent, and call quality in multi-step conversations has also improved.

According to Google, the updated audio model scores 71.5 percent accuracy on function calls in the ComplexFuncBench benchmark, putting it ahead of OpenAI's gpt-realtime at 66.5 percent. It's worth noting, though, that Google likely didn't test against the latest realtime version, which OpenAI released just yesterday.

The update is now available in Google AI Studio, Vertex AI, Gemini Live, and Search Live. Google Cloud customers are already using the technology, and developers can test the model through the Gemini API.

AI News Without the Hype – Curated by Humans

Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.

Read on for the full picture.
Subscribe for hype-free coverage.

  • Access to all THE DECODER articles.
  • Read without distractions – no Google ads.
  • Access to comments and community discussions.
  • Weekly AI newsletter.
  • 6 times a year: “AI Radar” – deep dives on key AI topics.
  • Up to 25 % off on KI Pro online events.
  • Access to our full ten-year archive.
  • Get the latest AI news from The Decoder.
Subscribe to The Decoder