mravanelli / pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
2.36k stars 446 forks source link

using different features instead of FMLLR #257

Open Miamoto opened 2 years ago

Miamoto commented 2 years ago

Hello!

Here in this image you have the procedure to follow when using fmllr features. But if I want to use FBANK for example do I follow also this steps and then change the paths in the cfg file to have the fbank features path? The alignments should be done with fmllr or fbank? What is the impact in both situations? You also align the training data "steps/align_fmllr.sh --nj 30 data/train_clean_100 data/lang exp/tri4b exp/tri4b_ali_clean_100" but then you never use the in the cfg file, so why aligning it?

Captura de ecrã 2022-03-08, às 15 22 09

Thanks a lot, Carlos