pytorch / torchtune

PyTorch native finetuning library
https://pytorch.org/torchtune/main/
BSD 3-Clause "New" or "Revised" License
4.37k stars 445 forks source link

Apply gradient accumulation fix to DPO/PPO recipes #2037

Open SalmanMohammadi opened 5 days ago