stanfordnlp / pyreft

ReFT: Representation Finetuning for Language Models
https://arxiv.org/abs/2404.03592
Apache License 2.0
947 stars 77 forks source link

ReFT + DPO Tutorial #76

Closed AmirZur closed 2 months ago

AmirZur commented 2 months ago

Introduced tutorial for ReFT + DPO in examples/dpo folder.

Files:

frankaging commented 2 months ago

Thanks!