mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Mozilla Public License 2.0
25.39k stars 3.97k forks source link

Find Word Error Rate? #3676

Open Chaitanya1597 opened 3 years ago

Chaitanya1597 commented 3 years ago

Currently, I'm using DeepSpeech 0.9.3 and trying to find out Word Error Rate on the Common Voice Mozilla dataset. My question is, is there any python script to find out WER? if Yes, please help me with this.

yunzqq commented 3 years ago

I also want to know how could I calculate the WER?

RobinE89 commented 3 years ago

@Chaitanya1597 and @yunzqq - what you are looking for is the metric called - "Levenshtein Distance". You have to calculate this between the Ground-Truth-Text (alias Referenztext (the origin voice input that you made as text)) and the Deepspeech - Transcription (output text) more useful informations: Articel from Rafael C. Carrasco - i hope this is helpful for you?