summary Summary

Supertone AI is an expressive text-to-audio platform that claims to take on the publishing industry.


Beyond text-to-speech

Supertone's technology represents a leap forward in traditional text-to-speech synthesis. The company has developed a proprietary speech synthesis technology called Nuvo.

This not only speaks and sings, but also captures emotions. This technology enables the production of voices that are virtually indistinguishable from human voices.

Supertone's Controllable Voice Conversion (CVC) technology allows users to convert their voice into any other voice in real-time. This means that users can transform their voice into that of a man, an old person, a woman, or even a child, playing multiple characters as needed.


It's not just voice cloning like Eleven Labs or Descript, it's voice augmentation.

This makes the text-to-speech platform perfect for audiobooks, where different voice types are needed for different characters.

Recent results include this performance of the voice of a deceased South Korean pop star.

Or this performance of Freddie Mercury in Korean.

Customizing Voices

Supertone Nuvo also gives users the option to customize their voices. There are pre-selected base voices that users can select and design for further customization. This technology uses neural networks trained on unsorted data of the target voice to produce raw audio, providing a natural voice synthesis process.


Product Lineup

Supertone AI is a suite of tools including Nest, the Voice Gene Designer, Voice Content Creator, Real-Time Voice Converter, and Real-Time Voice Separator.

One of the standout products, the GOYO Voice Separator, is a post-production plugin that can de-noise, de-reverb, and de-voice audio by separating it into three channels with full control over the audio elements. It's a cut above its competitors, Eleven Labs, Lovo by Genny and Murf.


Detailed pricing for paid plans is not readily available. This is because Supertone voice synthesis technology is only available to authorized partners. Voice synthesis turnaround times may vary depending on the scope of the project.

To work with Supertone AI, interested users must first submit a business proposal to the team. Currently, the tool is only available to those who meet the company's criteria. Once reviewed by the team, the user and the company sign an agreement to move forward. This avoids some of the deepfake issues that other platforms face.

Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

The founders

Supertone AI was founded in March 2020 by Choi Hee-doo, Huh Hoon, Hyungseok Choi, Kim Wansoo, Kyogu Lee, and Lee Juheon from South Korea.

The company's goal is to provide creators with an innovative voice content production environment, from idea generation to post-production.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
  • Supertone AI offers advanced, expressive text-to-audio technology with an audiobook use case. They use their own model called Nuvo.
  • Features include customization, real-time conversion, and voice separation.
  • Access is limited to authorized partners who must submit a business plan.
Journalist and published fiction author Harry is leveraging AI tools to bring his stories to life in new ways. He is currently working on making the first entirely AI generated movies from his novels and has a serialised story newsletter illustrated by Midjourney.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.