AI in practice

AI-powered text-to-speech tool PlayHT offers custom voice cloning and collaboration features

Harry Verity

PlayHT

PlayHT is another AI-powered text-to-speech (TTS) tool that started life as a voice tool for Medium.

PlayHT’s main use case is for creating podcasts. It provides a wide range of features, including extensive language and voice selection.

One of PlayHT’s best features is that it brings together text-to-speech voices from Google, Amazon, IBM, and Microsoft in one API.

It supports 132 languages and an impressive 832 voices in total as well as MP3 & WAV Export for easy uploading to platforms like Spotify and Apple Podcasts.

A unique feature of PlayHT, custom voice cloning enables users to create custom voice models. These are based on specific audio inputs, bringing an added layer of personalization to the table​.

PlayHT's full support for Speech Synthesis Markup Language (SSML) allows users to control various aspects of speech such as emphasis, speed, and pauses, resulting in nuanced and human-like speech output.

PlayHT provides secure cloud storage for synthesized audio files, allowing users to manage their files with ease and confidence. The software also provides tools for team access and collaboration. It allows an entire team to collaborate, share and create audio files together.

Use Cases

PlayHT's AI-generated voices find applications in various fields. Marketers and content creators can utilize these voices for marketing, explainer, product, and YouTube videos​

The technology can be leveraged for e-learning purposes, narrating training materials with the correct pronunciation of terminologies and acronyms​.

Developers can integrate human-like voices into devices and applications using the provided API​​.

Businesses can enhance their customer service with professional voice interactions on IVR and telephony systems​.

Webmasters can improve accessibility and engagement on their websites by embedding SEO-friendly audio widgets​​.

PlayHT recently introduced its latest model, PlayHT Turbo, which can convert input text to speech extremely fast. The delay is said to be only 150 to 200 ms, i.e. perceived real time.

From Chrome Extension to AI App

PlayHT was started as a Chrome extension for listening to Medium articles in 2016. After gaining recognition on Product Hunt, the team saw an opportunity to evolve PlayHT into a tool for creating realistic audio content.

Today, it assists some of the largest companies globally in generating high-quality text-to-speech content for their applications​​. The company operates remotely and focuses on delivering high-quality text-to-speech synthesis and audio accessibility solutions​.

Pricing

PlayHT offers affordable plans starting from $9 per month for 10,000 words per month​ and $39 for 50,000 words a month with faster generations.

The highest tier point is $99 a month, which has the added benefit of 1 high-fidelity clone alongside 200,000 words a month.

Sources: