OpenAI Releases Three Realtime Audio Models: GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper in the Realtime API
OpenAI has released three new audio models for its Realtime API, each targeting a different capability for live voice applications: GPT-Realtime-2 for intelligent voice agents, GPT-Realtime-Translate for live speech translation, and GPT-Realtime-Whisper for transcribed streams. Alongside the release of the model, the Realtime API is officially out of beta and now generally available – a … Read more