After using Axolotl to SFT my mistral7b model I tried to align it using DPO
At some point in the code (in the DPOTrainer initialization) the code freezes and stops after timeout is reached.
When trying to run the script on the base model (https://huggingface.co/TokenBender/pic_7B_mistral_Full_v0.2) it works well.
Attaching a screenshot of the part where it freezes.
After using Axolotl to SFT my mistral7b model I tried to align it using DPO At some point in the code (in the DPOTrainer initialization) the code freezes and stops after timeout is reached. When trying to run the script on the base model (https://huggingface.co/TokenBender/pic_7B_mistral_Full_v0.2) it works well. Attaching a screenshot of the part where it freezes.