saharmor / whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
MIT License
777 stars 140 forks source link

Code switching inputs #19

Closed jayakrishnanmm closed 1 year ago

jayakrishnanmm commented 1 year ago

import whisper model = whisper.load_model("large-v2") result = model.transcribe('test_file/EN-ZH.wav') print(result['text']) The above audio contains a mix of English and Chinese language. But the result is just pure English only. How did you get code-switch output ? The same file played back in the demo gives the expected output. Pls tell me how it is done

saharmor commented 1 year ago

Are you sure you're not using the English only Whisper models? https://github.com/openai/whisper#available-models-and-languages