idiap / w2v2-air-traffic

MIT License
34 stars 6 forks source link

there were 0 empty or blank utterances #5

Open JiweiTian opened 10 months ago

JiweiTian commented 10 months ago

Please help fix the issue. rootatm@DESKTOP-SHTIP30:/mnt/i/ASR/bert-text-diarization-atc-main$ bash data/databases/uwb_atcc/data_prepare_uwb_atcc_corpus.sh there were 0 empty or blank utterances printing the text file in: experiments/data/uwb_atcc/prep/text2_raw_spk printing the text and tags file in: experiments/data/uwb_atcc/prep/utt2speakerid Traceback (most recent call last): File "/mnt/i/ASR/bert-text-diarization-atc-main/data/utils/gen_train_test.py", line 61, in main() File "/mnt/i/ASR/bert-text-diarization-atc-main/data/utils/gen_train_test.py", line 42, in main x_train, x_test = train_test_split( File "/home/rootatm/.local/lib/python3.10/site-packages/sklearn/utils/_param_validation.py", line 214, in wrapper return func(*args, **kwargs) File "/home/rootatm/.local/lib/python3.10/site-packages/sklearn/model_selection/_split.py", line 2649, in train_test_split n_train, n_test = _validate_shuffle_split( File "/home/rootatm/.local/lib/python3.10/site-packages/sklearn/model_selection/_split.py", line 2305, in _validate_shuffle_split raise ValueError( ValueError: With n_samples=0, test_size=None and train_size=0.8, the resulting train set will be empty. Adjust any of the aforementioned parameters.