Open ionymikler opened 1 week ago
Hi, @ionymikler,
It might be the the way of loading the pretrained ckpt. The --model_name_or_path
parameter should point to the directory containing the pretrained model weights. This directory must include at least the following files:
config.json
: The model configuration file defining architecture parameters.preprocessor_config.json
: If special input preprocessing is required, this file might be necessary.pytorch_model.bin
: The pretrained model's weights file.Thus, please ensure that --model_name_or_path
points to the complete model folder downloaded from the Hugging Face repository, not to a single file path.
Hi @falcon-xu !
I am trying to run the
eval_highway_deit.sh
script using the model you published in this HF repo, but running into some issues.Originally, your
eval_highway_deit.sh
script has the following lines:I thought the
mode_name_or_path
should point to themodel.bin
that comes from the HF repo, but I get the following error (snippet here but uploaded a txt with all the error traceback):Looks like the it is asking for a config file, but I'm unsure what that means. I tried providing then the
config.json
from the same HF model repo, but also didn't work. This time it gave this error:I guess my question is really how to provide the script with the pretrained model?
Also, what are some scripts named 'highway', what is the meaning of it?
Thanks for your help!