Open peperunas opened 1 year ago
It's interesting, but this will require quite some backend work. We have support for microphone, and works not bad at all. I prefer it to the system speech recognized in windows.
I'm gonna park this for now and close it, as it will probably make the app much bulkier and different. Will reopen if things change :)
whisper is supported in Localai, I hope a upload and transcribe function can be implemented. API is well documented in https://platform.openai.com/docs/guides/speech-to-text/prompting , and how to separate long audio to less than 25MB chunks is also in the doc. As for now I can only use command line or python notebook to do the work.
Could we reopen this? I think Whisper from OpenAI is easy to integrate and would not change the UX. Same recrod button. The key is the amazing accuracy. I switched to using Whisper to handle my mac and won't look back.
Could we reopen this? I think Whisper from OpenAI is easy to integrate and would not change the UX. Same recrod button. The key is the amazing accuracy. I switched to using Whisper to handle my mac and won't look back.
ok
I propose integrating OpenAI's Whisper Automatic Speech Recognition (ASR) system [GitHub]. Whisper is designed to convert spoken language into written text.
Is it something that might be of interest?