Closed xiaozhi2015 closed 4 years ago
@taylorlu If I build a lib of target people, can I match the output label with my lib?
No, the index of the output speaker is start of zero, and the diarization cannot recognize the origin speaker since the embeddings of each sliding window become more continuous rather than discrete as the training speakers.
OK,thx!
Could every output label match the target people of openSLR?