The output should be in 'th' language, instead the output is mostly 'en'.
..., {'text': ' When you try, you will find a gap in between. When you miss, you will sit on the hard floor. It makes you want to get up and fight. But when you fight and you start to get something, you will find a gap. This gap is a trap. Some people are good at it, but they miss it. Some people are good at it, but they miss it. Some people are good at it, but they miss it. Some people are good at it, but they miss it. Some people are good at it, but they miss it.', 'start': 1120.026, 'end': 1140.435}, {'text': ' ', 'start': 1140.435, 'end': 1159.053}, {'text': ' ', 'start': 1159.053, 'end': 1185.282}, {'text': ' The best in Thailand, the first in the Olympic life, has been reading books all the time. He has won the Olympic gold medal.', 'start': 1185.282, 'end': 1203.08}], 'language': 'th'}
In the wav file, he's speaking in 'th', but somehow the transcription is the translation of his speech.
I tried to use fine-tuned model with whisperx, so i first convert the model using this code.
then run transcribe
The output should be in 'th' language, instead the output is mostly 'en'.
In the wav file, he's speaking in 'th', but somehow the transcription is the translation of his speech.
Anything to fix this? Thank you in advance.