Closed llauraa23 closed 8 months ago
Implementation of sft training that also supports d2l application. Implementation of inference evaluation of fine-tuned model. Implementation to support DPO training (on-going).
Implementation of sft training that also supports d2l application. Implementation of inference evaluation of fine-tuned model. Implementation to support DPO training (on-going).