OpenAI expands AI capabilities with new audio models for voice agents
ChatGPT maker OpenAI has now launched new speech-to-text and text-to-speech audio models in API to enhance voice agents. OpenAI stated that these new models, “set a new state-of-the-art benchmark, outperforming existing solutions in accuracy and reliability—especially in challenging scenarios involving accents, noisy environments, and varying speech speeds.”
These enhancements improve transcription accuracy, making the models particularly effective for applications such as customer service call centres, meeting note-taking, and other similar use cases.