Ad
Skip to content

Chatterbox is a free open-source voice cloning model with emotional tone control

Resemble AI has released Chatterbox, a free open-source voice cloning model that runs locally and supports emotional tone control like "dramatic" or "monotone." It clones voices using just a few seconds of audio and responds in under 200 milliseconds. The tool works on Windows, Mac, and Linux with 5–6 GB of video memory. All generated speech includes a faint watermark, "PerTh," to identify it as AI-made. According to Resemble AI, it performed better than ElevenLabs in blind tests. Currently, it only supports English.

Decoder EN demo (heightened emotional expression)

Chatterbox is licensed under MIT and targets developers. Check out the demo here.

Ad
DEC_D_Incontent-1

AI News Without the Hype – Curated by Humans

As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.

Source: Github