Open coding-dallas opened 1 year ago
Hi, it seems you should use librispeech train-other-500h data for making LibriPhrase evalution set in principle. If you want to use librispeech test-clean, you need to reduce maxspk since the number of speakers from test-clean is smaller than train-other-500h.
i am not clear what the parameters ( --numpair, --maxsp) tells
of positive case rows = 0
of negative case rows = 0
Traceback (most recent call last): File "/asr3/kesav/keywrd_file_prep/LibriPhrase/LibriPhrase_test_clean/codes/libriphrase.py", line 110, in
main(args)
File "/asr3/kesav/keywrd_file_prep/LibriPhrase/LibriPhrase_test_clean/codes/libriphrase.py", line 95, in main
total_df = total_df.sort_values(by=['anchor_spk', 'anchor_text', 'target', 'type', 'comparison_spk'], ascending=[True, True, True, True, True])
File "/home/asr/.local/lib/python3.10/site-packages/pandas/core/frame.py", line 6740, in sort_values
keys = [self._get_label_or_level_values(x, axis=axis) for x in by]
File "/home/asr/.local/lib/python3.10/site-packages/pandas/core/frame.py", line 6740, in
keys = [self._get_label_or_level_values(x, axis=axis) for x in by]
File "/home/asr/.local/lib/python3.10/site-packages/pandas/core/generic.py", line 1778, in _get_label_or_level_values
raise KeyError(key)
KeyError: 'anchor_spk'
I am trying to prepare libriphrase dataset from librispeech test clean. for up to 2 word class, it is working fine. when extracting 3 word class, it is throwing the above error.
below are the parameter i passed for preparing the dataset.
python3 libriphrase.py --libripath '/asr3/kesav/keywrd_file_prep/LibriPhrase/LibriPhrase_test_clean/data/LibriSpeech_clean_wav/' --newpath '/asr3/kesav/keywrd_file_prep/LibriPhrase/LibriPhrase_test_clean/data/LibriPhrase_diffspk_all/' --wordalign '/asr3/kesav/keywrd_file_prep/LibriPhrase/LibriPhrase_test_clean/metadata/librispeech_clean_test_all_utt_with_flac.csv' --output '/asr3/kesav/keywrd_file_prep/LibriPhrase/LibriPhrase_test_clean/metadata/librispeech_clean_test_short_phrase.csv' --numpair 3 --maxspk 1611 --maxword 4 --mode 'diffspk_all'