Closed parthplc closed 3 years ago
Hi @parthplc, my quick answer is language independent. But let me clarify some more. This research is for general purpose audio representation, not for speech recognition. So our encoder network is trained without putting an assumption of the type of the sound. I hope this answers your question.
Hi, I guess the question might be fulfilled, so let me close for now. Please feel free to reopen whenever you need.
Can we create vector representation using a pretrained model only for English or is it language Independent?