Closed AmirZur closed 2 months ago
Introduced tutorial for ReFT + DPO in examples/dpo folder.
examples/dpo
Files:
examples/dpo/dpo_trainer.py
DPOTrainer
trl
examples/dpo/dpo.ipynb
examples/dpo/README.md
Thanks!
Introduced tutorial for ReFT + DPO in
examples/dpo
folder.Files:
examples/dpo/dpo_trainer.py
- adaptingDPOTrainer
fromtrl
library to ReFT models.examples/dpo/dpo.ipynb
- tutorial notebook for ReFT + DPO on TruthfulQA dataset.examples/dpo/README.md
- overview file for tutorial.