Speaker diarization for test wav files while using pre-trained model from callhome_v2

kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Other

14.27k stars 5.32k forks source link

Hi, I am still new to Kaldi. I would like to perform diarization on some of the speech samples from my own dataset which do not have any speaker labels available, so I would have to listen and compare it to what the diarization outputs. I have a questions on this:

a) Does it make sense to use a pre-trained model, such as the callhome_v2 model, as there maybe different recording conditions, dialect and possibly language? Or are we assuming that the pretrained model has learned generelizable features (xvectors) so to be able to work well even on an unseen dataset?

Thanks in advance

kaldi-asr / kaldi

Speaker diarization for test wav files while using pre-trained model from callhome_v2 #4400