large audio file language processing

zkmkarlsruhe / language-identification

Spoken Language Identification on Common Voice and AudioSet using Deep Learning

Other

36 stars 7 forks source link

large audio file language processing #5

Open MostafaAlaviyan opened 1 year ago

MostafaAlaviyan commented 1 year ago

Hi, At the first, thanks for the valuable repo. I have some audio file with average length of 15 minutes that several people with different language are talking in it. How can I use your pretrained model to handle the aforementioned audio file? Best regards @bytosaur @danomatika @loelkes

danomatika commented 1 year ago

Howdy Mostafa,

How can I use your pretrained model to handle the aforementioned audio file?

Good question. The readme says: "All models expect 5 seconds of normalized audio sampled at 16kHz..." so this can be fed in to see what you get out. What is missing in the readme is how to load a file. @bytosaur I assume this is possible but just not documented?