Ad
Skip to content

OpenAI's new Realtime API lets developers add realistic conversations to their apps

Image description
OpenAI (Screenshot)

OpenAI announced new features for app developers at its DevDay conference. The company is now offering its advanced speech synthesis technology for integration into third-party applications.

The new "Realtime API" lets developers add six AI voices to their apps. These voices are different from those used in ChatGPT. To avoid legal issues, developers can't use third-party voices.

OpenAI showed off a travel planning app using the Realtime API. Users could talk to an AI assistant about a London trip and get quick responses. The API can also add restaurant suggestions to maps.

The technology works for phone calls too, like placing orders. OpenAI doesn't automatically disclose it's an AI voice, leaving that up to developers for now.

Ad
DEC_D_Incontent-1

New GPT-4o features and cost savings

OpenAI also announced that developers can now use images to fine-tune GPT-4o. With just 100 example images, the model's performance can be improved for specific visual tasks.

A new prompt caching feature aims to reduce costs and latency. By reusing recently seen input tokens, developers can get a 50 percent discount and faster processing times.

Prompt caching is automatically applied to the latest versions of GPT-4o, GPT-4o mini, o1-preview and o1-mini, as well as fine-tuned versions of these models.

"Model distillation" allows smaller models like GPT-4o mini to be optimized using outputs from larger models. OpenAI is providing new integrated tools for this, including saved completions and evaluation options.

Ad
DEC_D_Incontent-2

OpenAI is doubling the rate limit for its new o1 model. To help developers get started, the company is offering free training quotas for GPT-4o and GPT-4o mini until the end of October.

AI News Without the Hype – Curated by Humans

As a THE DECODER subscriber, you get ad-free reading, our weekly AI newsletter, the exclusive "AI Radar" Frontier Report 6× per year, access to comments, and our complete archive.

AI news without the hype
Curated by humans.

  • Over 20 percent launch discount.
  • Read without distractions – no Google ads.
  • Access to comments and community discussions.
  • Weekly AI newsletter.
  • 6 times a year: “AI Radar” – deep dives on key AI topics.
  • Up to 25 % off on KI Pro online events.
  • Access to our full ten-year archive.
  • Get the latest AI news from The Decoder.
Subscribe to The Decoder