xenova / whisper-web

ML-powered speech recognition directly in your browser
https://hf.co/spaces/Xenova/whisper-web
MIT License
1.29k stars 152 forks source link

Speech Recognition/Whisper word level scores or confidence output #32

Open wobbble opened 2 months ago

wobbble commented 2 months ago

Hey, Big thanks for awesome project!

It possible to add score/confidence for word level output when using Speech Recognition/Whisper model? Would appreciate any direction/comments or suggestion where to dig to add it. Happy to submit PR if I will success in it.

Thanks!

decoder-sh-david commented 1 week ago

Seconded, I have also not been able to successfully get word-level timestamps while running on webgpu. Would love to have both!