lipiji / SongNet

Code for ACL 2020 paper "Rigid Formats Controlled Text Generation":https://www.aclweb.org/anthology/2020.acl-main.68/
MIT License
230 stars 40 forks source link

预训练模型的vocab.txt提供错误? #32

Closed hu0810 closed 1 year ago

hu0810 commented 1 year ago

想要用pre-trained模型重新训练时,出现: RuntimeError: Error(s) in loading state_dict for BIGLM: size mismatch for tok_embed.weight: copying a param with shape torch.Size([28781, 768]) from checkpoint, the shape in current model is torch.Size([6410, 768]). 的错误,个人推测似乎是vocab.txt上传到songci的版本,想请问怎么处理?

hu0810 commented 1 year ago

後來發現是要替換./data中的vocab.txt