Closed alvarobartt closed 11 months ago
The run in Weights and Biases still seems to crash at the end even though everything's properly uploaded to the Hub, but I think it's related to the hub_strategy: every_save
in combination with save_steps: 500
, starting an async process that pushes the model into the Hub, and then DeepSpeed / accelerate throwing a timeout :/
Description
This PR adds the adapted
run_sft.py
fromhuggingface/alignment-handbook
to work with our dataset instead of HuggingFace H4's one, similarly to the one already adapted for DPO.This PR also updates the configuration files needed, and the SFT fine-tuning has already been triggered, in this case we keep the
chosen_response
as the response to use for SFT, while the rest is discarded.More details coming soon!