vercel / ai

Build AI-powered applications with React, Svelte, Vue, and Solid
https://sdk.vercel.ai/docs
Other
8.57k stars 1.21k forks source link

Text to Speech utils? #885

Open nabilfatih opened 6 months ago

nabilfatih commented 6 months ago

Feature Description

Love to see how AI SDK can handle Text to Speech from OpenAI. As I see from documentation, TTS can be streamed. https://platform.openai.com/docs/guides/text-to-speech/streaming-real-time-audio

Use Case

Chatbot but with speech. Like the chatGPT application on mobile.

Additional context

No response

01582 commented 6 months ago

need an intergration with google cloud apis or an server pre-hosted

01582 commented 6 months ago

you could do that with https://replicate.com/ and some open source project for text-to-speech

01582 commented 6 months ago

i don't think that vercel would add the text-to-speech utils

llermaly commented 5 months ago

Yes we will have: https://github.com/vercel/ai/pull/922

Now , @lgrammel I wonder if we could also add some STT to the text input to make it end-to-end conversational.

I'm experimenting with https://github.com/JamesBrill/react-speech-recognition