Closed RavidLightricks closed 1 year ago
Use the export_path as the model_name_or_path
without a checkpoint_dir
after the model exporting, and specify a different output_dir
@hiyouga still, i'm getting empty predictions, even when I'm using my own prediction script which is working on SFT models. I believe something in my export is not right.
try exporting the SFT and DPO weights separately
I think I might be missing something. In the export step, do I still need to supply the reference model? Maybe you guys can add it to the README, I also noticed that the argument dpo_ref_model was added.
ref_model is only used for computing metrics at evaluation. Here is the workflow we recommend:
stage=sft
export_dir
as a new base model to train a DPO model with stage=dpo
cli_demo
to check if the model can generate sentences by providing the DPO weights as the checkpoint_dir
stage=sft
and do_predict
Since DPO workflow doesn't support do_predict, I'm trying to export the model and then run do_predict with stf workflow. But the predictions I'm getting are empty strings.
What am I doing wrong?