Does VBx ever use the training partition in AMI-diarization-setup?

BUTSpeechFIT / VBx

Variational Bayes HMM over x-vectors diarization

252 stars 57 forks source link

Does VBx ever use the training partition in AMI-diarization-setup? #66

Closed alephpi closed 7 months ago

alephpi commented 7 months ago

When consulting https://github.com/BUTSpeechFIT/AMI-diarization-setup.git you proposed, I find it split into dev,test,train and I think dev is development, test is evaluation in your technical report, but what about the train? Have you ever used it in VBx (for xvector extractor training or inference)? I think no. Then what's the purpose of such a partition?

alephpi commented 7 months ago

Also I find that the VAD label in example/vad/ES2005a.lab is neither same as AMI-diarization-setup/only_words/labs/train/ES2005a.lab nor asAMI-diarization-setup/word_and_vocalsounds/labs/train/ES2005a.lab Could you explain? https://github.com/BUTSpeechFIT/VBx/blob/master/example/vad/ES2005a.lab https://github.com/BUTSpeechFIT/AMI-diarization-setup/blob/b158cbecae7c91298346c6bec1d6dcad473f61d3/only_words/labs/train/ES2005a.lab https://github.com/BUTSpeechFIT/AMI-diarization-setup/blob/b158cbecae7c91298346c6bec1d6dcad473f61d3/word_and_vocalsounds/labs/train/ES2005a.lab

fnlandini commented 7 months ago

Hi, We have not used the train set in this work. The split follows the Full-corpus-ASR partition of AMI https://groups.inf.ed.ac.uk/ami/corpus/datasets.shtml This dataset precedes our VBx work and we wanted to be sure that there was consistency in the labels for all parts.

As for example/vad/ES2005a.lab, it was probably generated with a system VAD and that is why it does not match the oracle VAD from AMI-diarization-setup. In any case, it is only used as an example to validate that the code runs.

alephpi commented 7 months ago

ok I see, thank you!