OpenAI launched three new audio models that can reason, translate across 70+ languages, and transcribe speech in real time, ...
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a ...
The new lineup includes GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. All three are available now through ...
OpenAI launched three real-time voice models, bringing GPT-5-class reasoning, 70-language translation, and live transcription ...
New AI trio: OpenAI unveiled three voice models for real-time reasoning, translation, and transcription, expanding its developer tools beyond text-based ChatGPT. Early adopters: Zillow, Priceline, and ...
There has always been one glaring issue with Voice AI demos. It seems like magic until something too complicated is thrown at ...
A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...
The three are GPT-Realtime-2, a successor to the company’s existing realtime voice model with what OpenAI describes as GPT-5-class reasoning; GPT-Realtime-Translate, a live translation model with more ...
The OpenAI ChatGPT Realtime API, now available in public beta, is transforming how developers create low-latency, multimodal applications. By seamlessly integrating speech, text, and function calling ...
What if your next phone call with customer support didn’t feel like a frustrating maze of robotic prompts but instead like a natural, empathetic conversation? Imagine an AI that not only understands ...
Agora's Conversational AI Engine offers key enhancements to the Realtime API for more natural communication and interaction. This milestone builds on Agora's partnership with OpenAI, as the Realtime ...