Support to choose unigram and bigram for P in LF-MMI training.

csukuangfj commented 3 years ago

With unigram LM for P

export CUDA_VISIBLE_DEVICES="0"

./mmi_att_transformer_train.py \
  --master-port=12355 \
  --full-libri=0 \
  --use-ali-model=0 \
  --max-duration=500 \
  --use-unigram=1

./mmi_att_transformer_decode.py \
  --use-lm-rescoring=1 \
  --num-paths=100 \
  --max-duration=300 \
  --use-unigram=1

With bigram LM for P

export CUDA_VISIBLE_DEVICES="1"

./mmi_att_transformer_train.py \
  --master-port=12356 \
  --full-libri=0 \
  --use-ali-model=0 \
  --max-duration=500 \
  --use-unigram=0

./mmi_att_transformer_decode.py \
  --use-lm-rescoring=1 \
  --num-paths=100 \
  --max-duration=300 \
  --use-unigram=0

Will report the result when it is available (Probably tomorrow morning).

csukuangfj commented 3 years ago

The following shows what the unigram P and bigram P look like when there are only 3 phones: a, b, and c.

unigram bigram