VinAIResearch / PhoWhisper

PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
Apache License 2.0
99 stars 10 forks source link

Is it possible to train the model with multi lingual languages ? #4

Open leviethung2103 opened 6 months ago

leviethung2103 commented 6 months ago

Hi VinAI Team,

Given a audio that speaker mainly speaks 90% of time in Vietnamese, 10% of time in English. I've tested your model with this type of audio and English words are interpreted as Vietnamese language.

I am thinking of re-train the model with a dataset that contains the English and Vietnamese in transcript. Do you think this approach is feasible or not ?

Thank you

Fannovel16 commented 3 months ago

It's possible as the base Whisper is already a multilingual ASR model