jondurbin / bagel

A bagel, with everything.
312 stars 31 forks source link

Hardware specification #9

Open paulcx opened 8 months ago

paulcx commented 8 months ago

Hi @jondurbin Thanks for sharing the bagel model. What kind of hardware specification for DPO trainning? Your DPO training is based on Lora but not full parameters, am I right?