Open zhanghx0905 opened 1 month ago
the one realtime api use? or a separate one.
the one realtime api use? or a separate one.
Separate TTS and STT api,
@zhanghx0905 openai's STT/TTS is not stream based, they can only process files. so they are not that ideal in realtime cases.
@zhanghx0905 openai's STT/TTS is not stream based, they can only process files. so they are not that ideal in realtime cases.
You may take a look at the livekit-agent GitHub repository. I tried their OpenAI plugin and adapt it to Chinese. I found it works just like a streaming service.
By the way, I have locally deployed TTS (Text-to-Speech) / STT (Speech-to-Text) services. In order to integrate them into applications compatible with the OpenAI API, I wrapped them in the OpenAI API format. Therefore, I hope you will also consider these APIs.
I'm wondering if the project currently supports OpenAI's TTS and STT APIs, or if there are any plans to integrate them.