I understand the general purpose of these two files, but I’m still a bit uncertain about how to generate them correctly. For instance, when creating an utt2spk file, does uttid correspond to the filename? And what should the corresponding spkid be? For different speakers, would it be appropriate to use identifiers like spk00, spk01, and spk02? For the second utterance, can I continue using spk00, spk01, and spk02, or would it be better to use new identifiers, starting from spk03?
Additionally, I have a question regarding the spk2utt file. Should speaker IDs be unique within each utterance, or do they need to be unique across the entire dataset?
Thank you very much for your time and assistance—it’s greatly appreciated!
I understand the general purpose of these two files, but I’m still a bit uncertain about how to generate them correctly. For instance, when creating an utt2spk file, does uttid correspond to the filename? And what should the corresponding spkid be? For different speakers, would it be appropriate to use identifiers like spk00, spk01, and spk02? For the second utterance, can I continue using spk00, spk01, and spk02, or would it be better to use new identifiers, starting from spk03?
Additionally, I have a question regarding the spk2utt file. Should speaker IDs be unique within each utterance, or do they need to be unique across the entire dataset?
Thank you very much for your time and assistance—it’s greatly appreciated!