GPT‑Realtime‑Whisper is a new streaming transcription model built for low-latency speech-to-text. It transcribes audio as ...
What’s new: OpenAI launched GPT‑Realtime‑2, GPT‑Realtime‑Translate, and GPT‑Realtime‑Whisper, enhancing voice AI in reasoning, translation, and transcription. Why it matters: The models enable more ...
The three are GPT-Realtime-2, a successor to the company’s existing realtime voice model with what OpenAI describes as GPT-5-class reasoning; GPT-Realtime-Translate, a live translation model with more ...
What if your next phone call with customer support didn’t feel like a frustrating maze of robotic prompts but instead like a natural, empathetic conversation? Imagine an AI that not only understands ...
There has always been one glaring issue with Voice AI demos. It seems like magic until something too complicated is thrown at ...
GPT-Realtime-2 is OpenAI’s first voice model with GPT-5-class reasoning designed for live conversational use cases. It ...
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...
May 7 (Reuters) - OpenAI introduced three audio models for its developer platform on Thursday, aiming to make voice-based ...
OpenAI announced its most advanced speech-to-speech AI model yet, GPT-Realtime. The new model, now available through OpenAI’s updated Realtime API, is said to be more reliable and cheaper than the ...
OpenAI's Realtime API is now optimized and generally available. You can try its latest speech-to-speech model, gpt-realtime. The upgrades improve OpenAI's voice ...
The new API features will help enterprises build autonomous, multimodal voice agents with remote tool access, PBX integration, and enhanced context awareness. OpenAI has added remote model context ...