whisper : add WER tests

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

MIT License

35.38k stars 3.61k forks source link

whisper : add WER tests #2454

Open ggerganov opened 3 weeks ago

ggerganov commented 3 weeks ago

It would be nice to start measuring the word error rate (WER) of whisper.cpp across some representative dataset:

short audio
long audio
english
non-english
etc.

This will help us catch regressions in the future. I'm not familiar with what is typically used for TTS WER benchmarks, so looking for help from the community.