Ad
Skip to content

OpenAI releases new models for its Realtime API

OpenAI has updated its Realtime API with three new model snapshots designed to improve transcription, speech synthesis, and function calling. According to developers, the gpt-4o-mini-transcribe variant significantly reduces hallucinations. For text-to-speech tasks, gpt-4o-mini-tts cuts the word error rate by 35 percent. The gpt-realtime-mini model, which targets voice assistants, follows instructions 22 percent more accurately and improves function calling by 13 percent.

OpenAI also explicitly mentioned improvements for Chinese, Japanese, Indonesian, Hindi, Bengali, and Italian.

Ad
DEC_D_Incontent-2

AI News Without the Hype – Curated by Humans

As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.