stanfordnlp / pyreft

ReFT: Representation Finetuning for Language Models
https://arxiv.org/abs/2404.03592
Apache License 2.0
947 stars 77 forks source link

[P0] Revert back to ortho init as unstable training #103

Closed frankaging closed 4 weeks ago