Closed SweetFov closed 4 years ago
Hi @SweetFov, thanks for your interest in NeMo and QuartzNet.
max_duration: 16.7
in YAML file).normalize_transcripts: False
set in YAML file?It seems like you use a different vocab.txt...Could you try use the original vocab.txt and see if the loss will decrease? If you want to keep your own vocabulary file, you may just refer to point 5 mentioned by @vsl9.
closing as this is related to the old version
same with the current version of nemo.
I fixed the encoder and only finetune the decoder, with the new vocabulary set.
Why can't we use new vocaburay set?
Hi, appreciate this great framework and your great work! Your pretrained Mandarin quartznet has very good performance on Aishell Testset, so I want to train the same model arch on our own Mandarin reading-style data from scratch; The train script is like this:
python -m torch.distributed.launch --nproc_per_node=2 ./jasper_aishell.py --batch_size=8 --num_epochs=150 --lr=0.00005 --warmup_steps=1000 --weight_decay=0.00001 --train_dataset=./word_4000h/lists/train.json --eval_datasets ./word_4000h/lists/dev_small.json --model_config=./aishell2_quartznet15x5/quartznet15x5.yaml --exp_name=quartznet_train --vocab_file=./word_4000h/am/token_dev_train_4400.txt --checkpoint_dir=$checkpoint_dir --work_dir=$checkpoint_dir
The training data is about 500 hours long. At first, the prediction is pretty much random;Then after several thousand iterations(before warmup ends), the predicitions stays as BLANK sequences for two epochs like this: Step: 46502020-01-07 09:53:20,694 - INFO - Loss: 110.91824340820312 2020-01-07 09:53:20,694 - INFO - training_batch_CER: 100.00% 2020-01-07 09:53:20,694 - INFO - Prediction: 2020-01-07 09:53:20,694 - INFO - Reference: 提起华华家的事情村民们声声长叹 Step time: 0.39273500442504883 seconds
I have tried the learning rate from 0.1 to 0.00005, warmpup steps from 1000 to 8000, batch size as 4,8,16,32, weight_decay from 0.001 to 0.00001, and none of those combinations could solve this problem. Have you ever encountered this kind of problem?