AI research

Jun 18, 2023Jun 18, 2023

Metas Voicebox is Stable Diffusion for speech

Max is the managing editor of THE DECODER, bringing his background in philosophy to explore questions of consciousness and whether machines truly think or just pretend to.

Profile

E-Mail

Meta's Voicebox is like Stable Diffusion for voices: The generative AI model synthesizes speech from text and can be used for various speech tasks. Voicebox generates realistic and expressive voices and allows attributes such as tone, style or accent to be adopted from audio files.

According to Meta, Voicebox outperforms existing speech synthesis models such as Microsoft's VALL-E in terms of speech quality and naturalness. "As the first versatile, efficient model that successfully performs task generalization, we believe Voicebox could usher in a new era of generative AI for speech.," Meta said. Due to the risk of misuse, the team has also developed a system for recognizing synthesized speech and has no plans to release Voicebox for the time being.

Video: Meta

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

Some Meta employees fear being sidelined as Zuckerberg reshuffles teams for AI progress

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI and society

Jul 3, 2025Jul 3, 2025

Meta tests chatbots with proactive messaging to boost retention

AI in practice

Jun 12, 2025Jun 12, 2025

Meta launches AI video editing but holds back on full features for now

Google News

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Metas Voicebox is Stable Diffusion for speech

Some Meta employees fear being sidelined as Zuckerberg reshuffles teams for AI progress

Meta tests chatbots with proactive messaging to boost retention

Meta launches AI video editing but holds back on full features for now

OpenAI launches GPT-5 as a unified system with adaptive reasoning for complex tasks

Google Deepmind's Genie 3 creates interactive 3D worlds that stay consistent for "multiple minutes"

Google upgrades Gemini with Deep Think and flags early warning risks

Metas Voicebox is Stable Diffusion for speech

Some Meta employees fear being sidelined as Zuckerberg reshuffles teams for AI progress

Meta tests chatbots with proactive messaging to boost retention

Meta launches AI video editing but holds back on full features for now