Not correctly conversion in one word or short audio and autocorrection in long audio

xenova / whisper-web

ML-powered speech recognition directly in your browser

MIT License

1.29k stars 152 forks source link

It is missing line in between paragraph audio and single word or short length audio transcription is not happning

This is what is output by the model. If you'd like to segment by "paragraphs", you could first do some processing to split the audio into chunks by a certain amount of silence.

please enhance these thing it is stucking on some word in paragraphp and taking it to the loop as well

Which model are you using?

and there are autocorrection and filler sentence is adding please do word wise so it can be more accurate. whatever user speak it should in text rather than auto generation of the text.

I don't quite understand what you mean here.

xenova / whisper-web

Not correctly conversion in one word or short audio and autocorrection in long audio #20