Open dazWiLLiE opened 2 months ago
As a temporary fix, try to do the following:
audiotext-v2.3.0
folder._internal
> whisperx
> aligment.py
"ro": "anton-l/wav2vec2-large-xlsr-53-romanian"
:
"sv": "KBLab/wav2vec2-large-voxrex-swedish"
Don't forget to add a comma at the end of the "ro"... ,
line.
Thank you. I'll try it right away.
Now I got:
Traceback (most recent call last):
File "handlers\whisperx_handler.py", line 53, in transcribe_file
File "whisperx\alignment.py", line 71, in load_align_model
Please find a wav2vec2.0 model finetuned on this language in https://huggingface.co/models, then pass the model name in --align_model [MODEL_NAME]")
ValueError: No default align-model for language: sv
Edit:
alignement.py
DEFAULT_ALIGN_MODELS_HF = {
"ja": "jonatasgrosman/wav2vec2-large-xlsr-53-japanese",
"zh": "jonatasgrosman/wav2vec2-large-xlsr-53-chinese-zh-cn",
"nl": "jonatasgrosman/wav2vec2-large-xlsr-53-dutch",
"uk": "Yehor/wav2vec2-xls-r-300m-uk-with-small-lm",
"pt": "jonatasgrosman/wav2vec2-large-xlsr-53-portuguese",
"ar": "jonatasgrosman/wav2vec2-large-xlsr-53-arabic",
"cs": "comodoro/wav2vec2-xls-r-300m-cs-250",
"ru": "jonatasgrosman/wav2vec2-large-xlsr-53-russian",
"pl": "jonatasgrosman/wav2vec2-large-xlsr-53-polish",
"hu": "jonatasgrosman/wav2vec2-large-xlsr-53-hungarian",
"fi": "jonatasgrosman/wav2vec2-large-xlsr-53-finnish",
"fa": "jonatasgrosman/wav2vec2-large-xlsr-53-persian",
"el": "jonatasgrosman/wav2vec2-large-xlsr-53-greek",
"tr": "mpoyraz/wav2vec2-xls-r-300m-cv7-turkish",
"da": "saattrupdan/wav2vec2-xls-r-300m-ftspeech",
"he": "imvladikon/wav2vec2-xls-r-300m-hebrew",
"vi": 'nguyenvulebinh/wav2vec2-base-vi',
"ko": "kresnik/wav2vec2-large-xlsr-korean",
"ur": "kingabzpro/wav2vec2-large-xls-r-300m-Urdu",
"te": "anuragshas/wav2vec2-large-xlsr-53-telugu",
"hi": "theainerd/Wav2Vec2-large-xlsr-hindi",
"ca": "softcatala/wav2vec2-large-xlsr-catala",
"ml": "gvs/wav2vec2-large-xlsr-malayalam",
"uz": "rifkat/wav2vec2-large-xls-r-300m-uz",
"ro": "anton-l/wav2vec2-large-xlsr-53-romanian",
"sv": "KBLab/wav2vec2-large-voxrex-swedish"
}
Okay, it seems I'll have to take a deeper look into this. I'll keep the issue open until I find a way to solve it.
Great, thanks!
HI. I also got the same error for yoruba language
No default align-model for language: yo
What's the temporary fix for that?
Steps to reproduce
Windows.
Downloaded the latest release, already have ffmpeg installed.
Transcription Language: Swedish Audio source: file (file.mkv) Transcription method: Whisper X Output filetype: srt
Clicked on "Generate transcription"
Took around an hour, then I got:
An .srt file was created, and looking at the result (here are the first 11 lines):
It seems it does a decent job, but it cant split the dialogs correctly.
Perhaps its because there is no align model?