gpt-realtime: новая голосовая модель от OpenAI

Q: Кто разработал gpt-realtime?

Модель gpt-realtime разработана компанией OpenAI (United States of America).

Q: Какие задачи решает gpt-realtime?

Распознавание речи, Speech synthesis, Визуальные ответы на вопросы, Speech-to-speech, Audio question answering

// задачи

Распознавание речиSpeech synthesisВизуальные ответы на вопросыSpeech-to-speechAudio question answering

// описание

Новая модель gpt-realtime от OpenAI выводит голосовое взаимодействие с ИИ на уровень живого общения с минимальной задержкой. Она безупречно следует сложным инструкциям и синтезирует естественную, эмоциональную речь, что делает её идеальным инструментом для создания продвинутых AI-ассистентов.

// abstract

We’re also releasing our most advanced speech-to-speech model yet—gpt-realtime. The new model shows improvements in following complex instructions, calling tools with precision, and producing speech that sounds more natural and expressive. It’s better at interpreting system messages and developer prompts—whether that’s reading disclaimer scripts word-for-word on a support call, repeating back alphanumerics, or switching seamlessly between languages mid-sentence. We’re also releasing two new voices, Cedar and Marin, which are available exclusively in the Realtime API starting today.

// faq

Что такое gpt-realtime?+

Кто разработал gpt-realtime?+

Какие задачи решает gpt-realtime?+

// похожие модели

Emu3.5

Beijing Academy of Artificial Intelligence / BAAI

34.1B

Gemini 2.5 Computer Use

Google

Octave 2

Hume