xenova / whisper-web

ML-powered speech recognition directly in your browser
https://hf.co/spaces/Xenova/whisper-web
MIT License
1.29k stars 152 forks source link

Not correctly conversion in one word or short audio and autocorrection in long audio #20

Open Exoways123 opened 9 months ago

Exoways123 commented 9 months ago

It is missing line in between paragraph audio and single word or short length audio transcription is not happning . please enhance these thing it is stucking on some word in paragraphp and taking it to the loop as well. and there are autocorrection and filler sentence is adding please do word wise so it can be more accurate. whatever user speak it should in text rather than auto generation of the text.

xenova commented 9 months ago

It is missing line in between paragraph audio and single word or short length audio transcription is not happning

This is what is output by the model. If you'd like to segment by "paragraphs", you could first do some processing to split the audio into chunks by a certain amount of silence.

please enhance these thing it is stucking on some word in paragraphp and taking it to the loop as well

Which model are you using?

and there are autocorrection and filler sentence is adding please do word wise so it can be more accurate. whatever user speak it should in text rather than auto generation of the text.

I don't quite understand what you mean here.