OpenAI推出低延迟语音交互的Realtime API公测版
原文英文,约500词,阅读约需2分钟。发表于: 。OpenAI launched the public beta of the Realtime API, offering developers the ability to create low-latency, multimodal voice interactions within their applications. Additionally, audio...
OpenAI推出了Realtime API公测版,支持低延迟、多模态语音交互,简化对话应用开发。Chat Completions API新增音频功能,适合不需低延迟的场景。Realtime API通过WebSocket支持实时对话,但语音选项有限。音频输入每分钟$0.06,输出$0.24,长时间使用成本较高。