For better transcription in more languages, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate #4
Because of the error rate viz and above al speaker detection your whisper ui is better for research use than all the others I have tried. Please consider implementing Meta's MMS with speech recognition and generation support for over 1000 languages at a drastically reduced error rate compared to Whisper:
Because of the error rate viz and above al speaker detection your whisper ui is better for research use than all the others I have tried. Please consider implementing Meta's MMS with speech recognition and generation support for over 1000 languages at a drastically reduced error rate compared to Whisper:
https://github.com/facebookresearch/fairseq/tree/main/examples/mms
https://ai.facebook.com/blog/multilingual-model-speech-recognition/