Amazon has introduced Nova Sonic, a new AI voice model designed to process speech natively and generate natural-sounding responses. The model reportedly matches the performance of leading speech models from OpenAI and Google in key metrics like speed, speech recognition, and call quality. The company has made Nova Sonic available through its Bedrock developer platform at what it claims is an 80% lower cost compared to OpenAI's GPT-4o, though OpenAI does offer a more affordable option with GPT-4o-Mini. Some components of Nova Sonic are already integrated into Amazon's Alexa+ service. According to Rohit Prasad, SVP and Chief Scientist for AGI at Amazon, the model stands out for its ability to handle speech recognition in challenging conditions and efficiently route user requests to various APIs.
Ad
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Sources
News, tests and reports about VR, AR and MIXED Reality.
Teenage Mutant Ninja Turtles are getting their own VR game
Pimax Crystal Super: New Ultrawide optics push VR field of view to 140 degrees
Meta Quest: With a little luck, you can experience the striking VR puzzle Connectome for free
MIXED-NEWS.com
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.