Open paulcx opened 8 months ago
Hi @jondurbin Thanks for sharing the bagel model. What kind of hardware specification for DPO trainning? Your DPO training is based on Lora but not full parameters, am I right?
Hi @jondurbin Thanks for sharing the bagel model. What kind of hardware specification for DPO trainning? Your DPO training is based on Lora but not full parameters, am I right?