The new model supports more than 30 languages, seven major Chinese dialects and over 20 regional accents A new artificial intelligence voice model from Alibaba Group Holding has beaten out Western ...
StepFun, the Shanghai lab that builds LLMs that punch above their weight, just turned that same energy on voice.
AI voice agents are getting closer to doing more than waiting their turn to speak. OpenAI announced Thursday that it is expanding its Realtime API with GPT-Realtime-2, a new voice ...
OpenAI has introduced three new realtime voice AI models, which are designed to help developers create smarter and more natural voice-based applications. The new models focus on live conversations, ...
GPT-Realtime-2 brings GPT-5-class reasoning to live voice. A separate translation model covers 70+ input languages. A streaming Whisper variant handles transcription. The pricing is aggressive enough ...
OpenAI has released three new audio models—GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper—built for real-time conversation, translation, and transcription. The models are available ...
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...
Discover the best text-to-speech AI voice generators of 2025, offering natural voices and powerful features for personal and business use.
Is AI leaving the era of "turn-based" chat? Right now, all of us who use AI models regularly for work or in our personal lives know that the basic interaction mode across text, imagery, audio, and ...
Translate, and Realtime-Whisper split voice into discrete models, reducing the orchestration overhead that has made enterprise voice agents costly to deploy.
Tool Calling Designed for More Reliability, GPT-5-Class Reasoning, and Improved Transcription Are Available Now in Early Availability 8x8, Inc. (NASDAQ: EGHT), a leading global business communications ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results