OpenAI launched three new audio models that can reason, translate across 70+ languages, and transcribe speech in real time, ...
GPT‑Realtime‑Whisper is a new streaming transcription model built for low-latency speech-to-text. It transcribes audio as ...
Translate, and Realtime-Whisper split voice into discrete models, reducing the orchestration overhead that has made ...
OpenAI has rolled out a new set of real-time audio models focused on making voice AI faster and more useful in live ...
The three are GPT-Realtime-2, a successor to the company’s existing realtime voice model with what OpenAI describes as GPT-5-class reasoning; GPT-Realtime-Translate, a live translation model with more ...
The new lineup includes GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. All three are available now through ...
OpenAI releases three new audio models for the API: GPT-Realtime-2 for real-time conversations, Translate for translations, ...
What’s new?: OpenAI released GPT‑Realtime‑Translate, alongside two other voice models, for real-time multilingual speech translation. Why it matters: The tool could help bridge communication gaps for ...
What’s new: OpenAI released three voice AI models with real-time reasoning, translation, and transcription capabilities, aiming to make conversations more interactive and task-oriented. Who’s testing: ...
Google brought its Gemini AI models to Google Translate in August to improve the translation experience for users, but also to add new features to the app, including support for real-time translation ...
GPT-Realtime-2 is OpenAI’s first voice model with GPT-5-class reasoning designed for live conversational use cases. It ...