Closed TeaPoly closed 4 years ago
It does seem like a problem in beam search stage. @cookingbear please look into this
Here is log: avg_acc_zero.log
can you share your config?
can you share your config?
can you share your config?
have you ever tested the result of the dev dataset in decoding step and does it have the same problem with that of the test dataset?
can you share your config?
have you ever tested the result of the dev dataset in decoding step and does it have the same problem with that of the test dataset?
I have tested the dev dataset, the results in decoding step looks fine.
can you share your config?
have you ever tested the result of the dev dataset in decoding step and does it have the same problem with that of the test dataset?
Overfiting or CMVN issue?
can you share your config?
have you ever tested the result of the dev dataset in decoding step and does it have the same problem with that of the test dataset?
Overfiting or CMVN issue?
The CMVN of the test dataset is missing. If you wanna run the script from the beginning, add the line below under the "# cal cmvn" line: tail -n +2 examples/asr/aishell/data/test.csv >> examples/asr/aishell/data/all.csv If you wanna run the decode_main.py, you can remove the examples/asr/aishell/data/cmvn, the system will automatically create the CMVN of the decoding dataset.
can you share your config?
have you ever tested the result of the dev dataset in decoding step and does it have the same problem with that of the test dataset?
Overfiting or CMVN issue?
The CMVN of the test dataset is missing. If you wanna run the script from the beginning, add the line below under the "# cal cmvn" line: tail -n +2 examples/asr/aishell/data/test.csv >> examples/asr/aishell/data/all.csv If you wanna run the decode_main.py, you can remove the examples/asr/aishell/data/cmvn, the system will automatically create the CMVN of the decoding dataset.
Get it. Thanks so much.
Thanks for your contribution firstly. It's hard to imagine that you not only finished the paper 'IMPROVING TRANSFORMER-BASED SPEECH RECOGNITION USING UNSUPERVISED PRE-TRAINING' but also shared the code. Thank you very much.
There is a problem when I using athena project. It looks fine in "Preparing data"、"Pretraining"、"Fine-tuning" step, develop dataset has high accuracy. But when I use athena/decode_main.py with test dataset in "Decoding" step, the "avg_acc" is always zero in log messages, like this:
INFO:absl:predictions: tf.Tensor([[4233]], shape=(1, 1), dtype=int64) labels: [[ 424 2477 3491 1238 850 1284 1269]] errs: 7 avg_acc: 0.0000 sec/iter: 0.3383
My script file is modified from "hkust" exmaple, here is my script:
I'm not sure where is the problem.