Snowdar / asv-subtools

An Open Source Tools for Speaker Recognition
Apache License 2.0
590 stars 135 forks source link

How to prepare utt2spk utt2lang trails files for asv-subtools/recipe/ap-olr2020-baseline? #14

Closed ASR2020Guru closed 3 years ago

ASR2020Guru commented 3 years ago

Hi @Snowdar ,

The recipe asv-subtools/recipe/ap-olr2020-baseline is designed for language recognition tasks. So at the data preparation stage, should I put the language label in the utt2spk file or in the utt2lang file?

I am new to language recognition, so I am litter confused about the above codes.

So what file I should use as input for subtools/getTrials.sh to generate the trials file?

Thanks

Snowdar commented 3 years ago

Hi, utt2spk is used and it means that language ids are seen as "speaker" ids in this system. Then, utt2spk and spk2utt are also used for subtools/getTrials.sh. For more details, just see the usages of this script.