kaldi-asr / kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.
http://kaldi-asr.org
Other
14.31k stars 5.32k forks source link

[Fisher] Issue on semisup/run_100k.sh #4658

Open JuanPZuluaga opened 3 years ago

JuanPZuluaga commented 3 years ago

Hi,

I've come across two problems in the https://github.com/kaldi-asr/kaldi/blob/master/egs/fisher_english/s5/local/semisup/run_100k.sh recipe.

  1. On stage 7: When running local/fisher_train_lms_pocolm.sh I get an error becuase the number o n-grams of the dataset (100k) is smaller than the number of n-grams to prune:
the num-ngrams(1544907) of input LM is less than the target-num-ngrams(5000000), can not do any pruning.
  1. On stage 10: the param --sup-lat-dir $exp_root/chain/tri4a_train_sup_unk_lats should be changed to --sup-lat-dir $exp_root/chain/tri4a_train_sup_sp_unk_lats which uses the sp version instead.

R, Juan Pablo

stale[bot] commented 2 years ago

This issue has been automatically marked as stale by a bot solely because it has not had recent activity. Please add any comment (simply 'ping' is enough) to prevent the issue from being closed for 60 more days if you believe it should be kept open.

kkm000 commented 2 years ago
  1. Does the first message cause any issues? Is not it just a warning?
  2. Do you want to send a PR for the second one?

Thanks!

stale[bot] commented 2 years ago

This issue has been automatically marked as stale by a bot solely because it has not had recent activity. Please add any comment (simply 'ping' is enough) to prevent the issue from being closed for 60 more days if you believe it should be kept open.