Closed PES2g closed 5 years ago
Our method is consistent for both training and testing:
How you know what speaker embeddings that correspond to overlapped speakers to remove?
@chienducnguyen It's from the ground truth. The ground truth has segments labelled with two speakers.
In your paper, during evaluation, you exclude overlapped speech. Which one below is the solution?
And during training, which is the solution ?