Ad
Skip to content

Google expands "Audio Overviews" to 75 languages using Gemini-based audio production

NotebookLM’s "Audio Overviews" feature is now available in approximately 75 languages, including less commonly spoken ones such as Icelandic, Basque, and Latin. The audio for each language is generated by AI agents using "metaprompting," with the Gemini 2.5 Pro language model as the underlying system. At the same time, Google is moving to an audio production technology based entirely on Gemini’s multimodality, a development that does not bode well for providers focused exclusively on audio models.

As with AI-generated text, audio created by language models can also contain inaccuracies. This issue is especially pronounced in AI-generated podcasts, where large amounts of audio may be produced from minimal source text, and the conversion from text to dialogue constitutes a significant alteration of the original material.

AI News Without the Hype – Curated by Humans

Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.

Read on for the full picture.
Subscribe for hype-free coverage.

  • Access to all THE DECODER articles.
  • Read without distractions – no Google ads.
  • Access to comments and community discussions.
  • Weekly AI newsletter.
  • 6 times a year: “AI Radar” – deep dives on key AI topics.
  • Up to 25 % off on KI Pro online events.
  • Access to our full ten-year archive.
  • Get the latest AI news from The Decoder.
Subscribe to The Decoder