Error when loading pretrained model for finetunning from a checkpoint of the pretrained model

jerryji1993 / DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

Apache License 2.0

578 stars 156 forks source link

Hi, Very simple issue, this error: "ValueError: loaded state dict contains a parameter group that doesn't match the size of optimizer's group" Is displayed when I'm trying to load a a pre-trained model for finetunning while doing so from a checkpoint folder located in the pretrained model's folder. When loading the model from the "root" folder of the pre-trained model (which contains the checkpoints) the finetunning is performed fine. The error is thrown before the start of the training.

To reproduce, simply follow the steps in the example in README.md including the pretraining (just set to lower number of epochs) and then for the finetunning at step 3.3 set the path of the model to one of the checkpoint folders like: If for the pretraining the output folder was set to: export OUTPUT_PATH=output$KMER Then for finetunning set: export MODEL_PATH=output$KMER/checkpoint-1800/

jerryji1993 / DNABERT

Error when loading pretrained model for finetunning from a checkpoint of the pretrained model #65