-
vosk api is defautly support for decoding of TDNN nnet3.
I did some experiments, the TDNN-LSTM is obviously better than TDNN and the TDNN-LSTM get 3% gain comparing with TDNN.
The TDNN-LSTM needs t…
-
作者能否公布一下msra结果相关的训练参数?norm = 0/1/2/3 ? learning rate batchsize之类的。
-
Hello,
I'm trying to train a model using the Aspire recipe, using the latest code from the master branch, but am encountering the following error when running `local/chain/run_tdnn_lstm.sh`. Whe…
-
-
Hi, i ran into a problem when training a lstm-rnn acoustic model on TIMIT database. Here are parts of my code and the corresponding result after running.There is an error, e.g., LOG (apply-cmvn:main(…
-
Hi~
I have some questions about trainning process.
Does the model only process fixed batch_size and sequence_length? How to process dynamic batches?
When running the last batch (less than batch_s…
-
Hi,
I try do decode the output of an acoustic model (CTC) built in pytorch with the eesen framework. On WSJ, I achieve good results with the `decode-faster` function as in [decode_ctc.sh](https://g…
-
Hi all,
I've seen the new tf_clean branch is available, so be trying to use it. I'm using ```swbd/v1-tf``` recipe and was able to train it successfully. However, I cannot find any script for single w…
-
such as word/phone start time, end time, confidence, etc.
tz301 updated
2 years ago
-
Hi, I am newer to learn icefall,I finished the training of tdnn_lstm_ctc, when run the decode steps, I meet the following error, I change the --max-duration, there are still errors:
![image](https:…