nusnlp / mlconvgec2018

Code and model files for the paper: "A Multilayer Convolutional Encoder-Decoder Neural Network for Grammatical Error Correction" (AAAI-18).
GNU General Public License v3.0
185 stars 73 forks source link

Error loading state_size in FConvModel : size mismatch in encoder-decoder weights #33

Open NikhilCherian opened 4 years ago

NikhilCherian commented 4 years ago

@shamilcm @gurunath-p

Hello there again. I was actually trying something new with the mlconvgec models where I trained them with latest fairseq model with different bpe bert and tokenizer moses.

Training was all fine. But there seems to error with the generate with test data jfleg for getting the GLEU scores.

python generate.py test/jfleg --path checkpoints/lang8-nucle-bert-moses/checkpoint_best.pt --batch-size 128 --beam 5 --nbest 12 --lang-model-data data-bin/wiki103 --lang-model-path data-bin/wiki103/wiki103.pt --source-lang en --target-lang gec --bpe bert --tokenizer moses --dataset-impl raw

The error is with a problem in size mismatch in the layers. image

Any help would be highly appreciated. Thanks in advance.