Closed Sara-Rajaee closed 1 month ago
Here it expects a DeepSpeed checkpoint instead of a Hugging Face checkpoint. One option would be training the model by yourself (python retrieval/main.py fit ...
). Another workaround is to convert the Hugging Face checkpoint to DeepSpeed by running the training script for 1 step with 0 learning rate (using something like retrieval/confs/cli_dummy.yaml).
Thanks for your answer, I'll close the issue
Hi, I want to run the experiments in "Retrieving Premises for All Proof States" using your HF checkpoints (without training the retriever). However, when I use the provided command, I get a deepspeed checkpoint error. I would appreciate it if you could help addressing this.
This is the command I use:
python retrieval/main.py predict --config retrieval/confs/cli_lean4_random.yaml --ckpt_path kaiyuy/leandojo-lean4-retriever-tacgen-byt5-small --trainer.logger.name predict_retriever_random --trainer.logger.save_dir logs/predict_retriever_random
This is the error I get: