kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.
http://kaldi-asr.org
Other
14.03k stars 5.3k forks source link

execute egs/yesno/s5/run.sh error #4860

Open zhanyu1 opened 1 year ago

zhanyu1 commented 1 year ago

Preparing train and test data Dictionary preparation succeeded utils/prepare_lang.sh --position-dependent-phones false data/local/dict data/local/lang data/lang Checking data/local/dict/silence_phones.txt ... --> reading data/local/dict/silence_phones.txt --> text seems to be UTF-8 or ASCII, checking whitespaces --> text contains only allowed whitespaces --> data/local/dict/silence_phones.txt is OK

Checking data/local/dict/optional_silence.txt ... --> reading data/local/dict/optional_silence.txt --> text seems to be UTF-8 or ASCII, checking whitespaces --> text contains only allowed whitespaces --> data/local/dict/optional_silence.txt is OK

Checking data/local/dict/nonsilence_phones.txt ... --> reading data/local/dict/nonsilence_phones.txt --> text seems to be UTF-8 or ASCII, checking whitespaces --> text contains only allowed whitespaces --> data/local/dict/nonsilence_phones.txt is OK

Checking disjoint: silence_phones.txt, nonsilence_phones.txt --> disjoint property is OK.

Checking data/local/dict/lexicon.txt --> reading data/local/dict/lexicon.txt --> text seems to be UTF-8 or ASCII, checking whitespaces --> text contains only allowed whitespaces --> data/local/dict/lexicon.txt is OK

Checking data/local/dict/extra_questions.txt ... --> data/local/dict/extra_questions.txt is empty (this is OK) --> SUCCESS [validating dictionary directory data/local/dict]

*Creating data/local/dict/lexiconp.txt from data/local/dict/lexicon.txt ERROR: FstHeader::Read: Bad FST header: standard input Preparing language models for test local/prepare_lm.sh: line 7: 11661 Segmentation fault arpa2fst --disambig-symbol=#0 --read-symbol-table=$test/words.txt input/task.arpabo $test/G.fst local/prepare_lm.sh: line 7: 11662 Segmentation fault fstisstochastic $test/G.fst ERROR: ReadFstClass: Can't open file: data/lang_test_tg/G.fst ERROR: FstHeader::Read: Bad FST header: tmpdir.g/empty_words.fst Succeeded in formatting data. steps/make_mfcc.sh --nj 1 data/train_yesno exp/make_mfcc/train_yesno mfcc utils/validate_data_dir.sh: WARNING: you have only one speaker. This probably a bad idea. Search for the word 'bold' in http://kaldi-asr.org/doc/data_prep.html for more information. utils/validate_data_dir.sh: Successfully validated data-directory data/train_yesno steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance. steps/make_mfcc.sh: Succeeded creating MFCC features for train_yesno steps/compute_cmvn_stats.sh data/train_yesno exp/make_mfcc/train_yesno mfcc Succeeded creating CMVN stats for train_yesno fix_data_dir.sh: kept all 31 utterances. fix_data_dir.sh: old files are kept in data/train_yesno/.backup steps/make_mfcc.sh --nj 1 data/test_yesno exp/make_mfcc/test_yesno mfcc utils/validate_data_dir.sh: WARNING: you have only one speaker. This probably a bad idea. Search for the word 'bold' in http://kaldi-asr.org/doc/data_prep.html for more information. utils/validate_data_dir.sh: Successfully validated data-directory data/test_yesno steps/make_mfcc.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance. steps/make_mfcc.sh: It seems not all of the feature files were successfully procesed (29 != 31); consider using utils/fix_data_dir.sh data/test_yesno steps/make_mfcc.sh: Less than 95% the features were successfully generated. Probably a serious error. steps/compute_cmvn_stats.sh data/test_yesno exp/make_mfcc/test_yesno mfcc Succeeded creating CMVN stats for test_yesno fix_data_dir.sh: kept 29 utterances out of 31 fix_data_dir.sh: old files are kept in data/test_yesno/.backup steps/train_mono.sh --nj 1 --cmd utils/run.pl --totgauss 400 data/train_yesno data/lang exp/mono0a steps/train_mono.sh: Initializing monophone system. bash: line 1: 12181 Segmentation fault ( gmm-init-mono --shared-phones=data/lang/phones/sets.int "--train-feats=ark,s,cs:apply-cmvn --utt2spk=ark:data/train_yesno/split1/1/utt2spk scp:data/train_yesno/split1/1/cmvn.scp scp:data/train_yesno/split1/1/feats.scp ark:- | add-deltas ark:- ark:- | subset-feats --n=10 ark:- ark:-|" data/lang/topo 39 exp/mono0a/0.mdl exp/mono0a/tree ) 2>> exp/mono0a/log/init.log >> exp/mono0a/log/init.log run.pl: job failed, log is in exp/mono0a/log/init.log mkgraph.sh: expected data/lang_test_tg/G.fst to exist steps/decode.sh --nj 1 --cmd utils/run.pl exp/mono0a/graph_tgpr data/test_yesno exp/mono0a/decode_test_yesno steps/decode.sh: Error: no such file exp/mono0a/final.mdl grep: exp/mono0a/decode_testyesno/wer: No such file or directory

I have consulted a lot of information, but still cannot solve it. I request help to solve this problem

stale[bot] commented 11 months ago

This issue has been automatically marked as stale by a bot solely because it has not had recent activity. Please add any comment (simply 'ping' is enough) to prevent the issue from being closed for 60 more days if you believe it should be kept open.