Resemble AI has released Chatterbox, a free open-source voice cloning model that runs locally and supports emotional tone control like "dramatic" or "monotone." It clones voices using just a few seconds of audio and responds in under 200 milliseconds. The tool works on Windows, Mac, and Linux with 5–6 GB of video memory. All generated speech includes a faint watermark, "PerTh," to identify it as AI-made. According to Resemble AI, it performed better than ElevenLabs in blind tests. Currently, it only supports English.
Ad
Decoder EN demo (heightened emotional expression)
Chatterbox is licensed under MIT and targets developers. Check out the demo here.
Ad
Ad
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.