OpenAI released a new generation of voice models in its API on Wednesday, giving developers tools to build apps that can reason through spoken requests, translate across +70 languages, and transcribe ...
OpenAI is adding three voice models to its Realtime API, giving developers tools for live reasoning, speech translation, and streaming transcription, the company said. The first model, GPT-Realtime-2, ...
GPT-Realtime-2 brings GPT-5-class reasoning to live voice. A separate translation model covers 70+ input languages. A streaming Whisper variant handles transcription. The pricing is aggressive enough ...
OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users ...
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Erik Steiger discusses the operational pain ...
What’s new: OpenAI released three voice AI models with real-time reasoning, translation, and transcription capabilities, aiming to make conversations more interactive and task-oriented. Who’s testing: ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results