Closed lhanzl closed 2 years ago
Hi, sorry about the reaction time. Does it hang on the training examples or on the validation?
Closed due to inactivity.
Hi @Jamiroquai88, I also meet the problem. It has hanged on the training examples for a whole day, and log file exp/egs/log/allocate_examples_train_subset.log
looks like this:
sid/nnet3/xvector/allocate_egs_but.py --prefix train_subset --num-repeats=3 --frames-per-chunk=400 --num-pdfs=7323 --num-jobs=1 --num-archives=1 --utt2len-filename=exp/egs/temp/utt2num_frames.train_subset --utt2int-filename=exp/egs/temp/utt2int.train_subset --egs-dir=exp/egs
Starting get_utt2len
Starting get_labels
Processing archive 1
Look for your kind help. Thanks!
Hi, if this only hangs on the train_subset
and not on the main training part, I would skip this step. It has been some time since this was implemented and I am not sure what is causing this issue.
more detailed steps by @MichalKlco: The training script doesn't use it. In the local/nnet3/xvector/get_egs_but.sh script, you have to comment all the parts related to train_subset in each stage, otherwise, it will fail on the way (stage 2-5). Stage 5 is little bit tricky, lines 217-222 seems like clearing some garbage (should be commented out if you skip valid/train_subset) and lines 224-227 should be commented out, too.
I get it. Thanks for your response!
I am reproducing the results of the experiment. The main script runs to stage=4
and is blocked for a long time as follows.
Can anyone tell me what the script is doing and how I can solver this problem. Thank you very much.