Open snehas89 opened 5 months ago
It seems that when uploading an audio or video in Kannada, only the initial portion gets transcribed accurately, while the subsequent part is transcribed in Tamil, as depicted in the provided screenshot. This likely arises due to a language detection error or a system glitch.
can u pls tell me step by step how to run this project in my machine
@snehas89 can you give us more details about the error. I have also noticed this issue. It's not new to us to be honest.
But if you provide more details like:
@snehas89 it will be helpful for us. Also @snehas89 did you want to help us with issue #9 ?
@kurianbenoy
Yes I did use 3 of the models provided i.e, SeamlessM4T, Faster-Whisper, WhisperX
Out of the 3 models Faster-Whisper gave a result better than the other two. My primary aim was to transcribe the audio file and later look into translation, but was not able to proceed with it.
@ronald0098 I'm not sure if I found any documentation on how to run the model locally, I used the Indic subtitler web app https://indicsubtitler.in/ @kurianbenoy can confirm if this is right
Can you share the local audio file here if possible? @snehas89
We haven't added the documentation on how to run model locally, but yeah we can do that when we are free. Created an issue #13 for this.
@kurianbenoy doc.zip
Please find the attached zip file, as github doesn't support audio formats uploading
Thanks @snehas89 for sharing the files via zip files. We can't do much for the time being to be honest.
Yet in the future, we might work on improving accuracy with LLMs, so these multiple language outputs doesn't happen.
It seems that when uploading an audio or video in Kannada, only the initial portion gets transcribed accurately, while the subsequent part is transcribed in Tamil, as depicted in the provided screenshot. This likely arises due to a language detection error or a system glitch.