Closed ZhaZhaFon closed 1 year ago
Hi, thank you for the interest. The VAD labels in here should correspond to the last row in Table 1. They are from the system we submitted so there is some error with respect to the oracle ones. You can use the oracle ones if you want, in the main script you only need to switch the VAD path and the rest of the recipe should work (it will, of course, produce different results since there will be no VAD error).
Hi, thank you for the interest. The VAD labels in here should correspond to the last row in Table 1. They are from the system we submitted so there is some error with respect to the oracle ones. You can use the oracle ones if you want, in the main script you only need to switch the VAD path and the rest of the recipe should work (it will, of course, produce different results since there will be no VAD error).
Hi, thank you for the interest. The VAD labels in here should correspond to the last row in Table 1. They are from the system we submitted so there is some error with respect to the oracle ones. You can use the oracle ones if you want, in the main script you only need to switch the VAD path and the rest of the recipe should work (it will, of course, produce different results since there will be no VAD error).
Now I see. Thanks.
By the way, what should I do if I want to switch to a speaker encoder trained by myself, e.g. the SOTA ECAPA-TDNN. I notice that in vbhmm.py, pre-computed files for PLDA are required. How can I produce this files for the speaker encoder trained on my own ? Are there any off-the-shelf tools ? ( I am very unfamiliar with PLDA and scoring back-ends...)
Thanks
Closing due to inactivity. Feel free to reopen if you see fit.
Hi,
Nice job ! Thanks for sharing.
I am trying to use your code to run VoxConverse. I tried to use ground-truth VAD results for clustering, but I found that VAD results in VBx/VAD/final_system is not consistent with the officially released ones. For example, an official VAD result (here) has more segments than yours (here)
How can I solve this ? Thanks.