The new features could be handy for customer service systems, but OpenAI says they have applications that work across a variety of other fields, including education and creator platforms.
OpenAI今天在API中推出三款全新音频模型,面向开发者开放。 这三款模型分别是: GPT-Realtime-2:首款具备GPT-5级别推理能力的语音模型,能处理更复杂的请求,并自然地推进对话。 GPT-Realtime-Translate:实时翻译模型,支持70多种输入语言翻译成13种输出语言,翻译 ...
GPT-Realtime-Whisper 是一种专为低延迟语音转文本而构建的新型流式转录模型。它能在人们说话的同时转录音频,从而使实时产品感觉更快、响应更灵敏、更自然——从即时显示的字幕到与对话同步的会议记录。
GPT-Realtime-2 brings GPT-5-class reasoning to live voice. A separate translation model covers 70+ input languages. A streaming Whisper variant handles transcription. The pricing is aggressive enough ...
OpenAI has introduced three new real-time voice models—GPT‑Realtime‑2 for reasoning, GPT‑Realtime‑Translate for live translation, and GPT‑Realtime‑Whisper for transcription—via its Realtime API. The ...
May 7 (Reuters) - OpenAI introduced three audio models for its developer platform on Thursday, aiming to make voice-based software agents more conversational and capable of completing tasks in ...
What’s been launched: OpenAI released GPT‑Realtime‑2, GPT‑Realtime‑Translate, and GPT‑Realtime‑Whisper via its API, adding advanced reasoning, live translation, and instant transcription capabilities.
Credit: VentureBeat made with GPT-Image-1.5 on fal.ai Until recently, the practice of building AI agents has been a bit like training a long-distance runner with a thirty-second memory. Yes, you could ...
GPT-5.5 lands seven weeks after the release of GPT-5.4, which arrived on March 5. OpenAI says the newest model “understands what you’re trying to do faster” and that it can “carry more of the work ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果