Chatterbox is a free open-source voice cloning model with emotional tone control

Jun 19, 2025

Resemble AI has released Chatterbox, a free open-source voice cloning model that runs locally and supports emotional tone control like "dramatic" or "monotone." It clones voices using just a few seconds of audio and responds in under 200 milliseconds. The tool works on Windows, Mac, and Linux with 5–6 GB of video memory. All generated speech includes a faint watermark, "PerTh," to identify it as AI-made. According to Resemble AI, it performed better than ElevenLabs in blind tests. Currently, it only supports English.

Decoder EN demo (heightened emotional expression)

Chatterbox is licensed under MIT and targets developers. Check out the demo here.

AI News Without the Hype – Curated by Humans

Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.

AI news without the hype
Curated by humans.

More than 16% discount.
Read without distractions – no Google ads.
Access to comments and community discussions.
Weekly AI newsletter.
6 times a year: “AI Radar” – deep dives on key AI topics.
Up to 25 % off on KI Pro online events.
Access to our full ten-year archive.
Get the latest AI news from The Decoder.

Subscribe to The Decoder

Chatterbox is a free open-source voice cloning model with emotional tone control

AI News Without the Hype – Curated by Humans

AI news without the hypeCurated by humans.

AI news without the hype
Curated by humans.