VinAIResearch / PhoWhisper

PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
BSD 3-Clause "New" or "Revised" License
113 stars 10 forks source link

Is it possible to train the model with multi lingual languages ? #4

Closed leviethung2103 closed 2 months ago

leviethung2103 commented 9 months ago

Hi VinAI Team,

Given a audio that speaker mainly speaks 90% of time in Vietnamese, 10% of time in English. I've tested your model with this type of audio and English words are interpreted as Vietnamese language.

I am thinking of re-train the model with a dataset that contains the English and Vietnamese in transcript. Do you think this approach is feasible or not ?

Thank you

Fannovel16 commented 6 months ago

It's possible as the base Whisper is already a multilingual ASR model