pretrained model size mismatch with the size of cl-tohoku/bert-base-japanese

neilctwu / YouyakuMan

Extractive summarizer using BertSum as summarization model

53 stars 15 forks source link

I got Error below with python youyakuman.py -txt_file testjp.txt -lang jp -n 3 --super_long

RuntimeError: Error(s) in loading state_dict for ModelLoader: size mismatch for bert.model.embeddings.word_embeddings.weight: copying a param with shape torch.Size([32006, 768]) from checkpoint, the shape in current model is torch.Size([32000, 768]). Packeges torch==1.3.0 transformers==2.9.0 googletrans==2.4.0 pyknp==0.4.1

I got pretrained model from the link on readme.md Am I doing something wrong or do I need to start collecting data and train model ?

neilctwu / YouyakuMan

pretrained model size mismatch with the size of cl-tohoku/bert-base-japanese #10