jitsi / gsoc-ideas

Google Summer of Code ideas
59 stars 29 forks source link

Speech to Text #30

Closed Rishabhk06 closed 6 months ago

Rishabhk06 commented 2 years ago

Hey... I was just exploring Speech to Text project ideas for GSoC 2022 and came across this Javascript Web Speech API which offers Speech Recognition. It seems to be quite accurate and real-time but the privacy policy is not very clear. However, it somewhere states:

Chrome currently takes the audio and sends it to Google's servers to perform the transcription.

Just wanted to know what are your views over the same and if it suits your requirements. Plus, I also tested the DeepSpeech open-source library over a local setup and the results are good enough for the English language but not very convincing for other regional languages. Do you consider it a probable solution? Looking forward to suggestions and feedback @nikvaessen

Regards Rishabh

nikvaessen commented 2 years ago

There are 2 issues with the Web Speech API:

1) we want a solution which can be completely self-hosted, no audio should be send to a third party. 2) the solution needs to run server-side, as implemented in Jigasi.