pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Here in this image you have the procedure to follow when using fmllr features. But if I want to use FBANK for example do I follow also this steps and then change the paths in the cfg file to have the fbank features path? The alignments should be done with fmllr or fbank? What is the impact in both situations? You also align the training data "steps/align_fmllr.sh --nj 30 data/train_clean_100 data/lang exp/tri4b exp/tri4b_ali_clean_100" but then you never use the in the cfg file, so why aligning it?
Hello!
Here in this image you have the procedure to follow when using fmllr features. But if I want to use FBANK for example do I follow also this steps and then change the paths in the cfg file to have the fbank features path? The alignments should be done with fmllr or fbank? What is the impact in both situations? You also align the training data "steps/align_fmllr.sh --nj 30 data/train_clean_100 data/lang exp/tri4b exp/tri4b_ali_clean_100" but then you never use the in the cfg file, so why aligning it?
Thanks a lot, Carlos