xichen-fy / Fira

Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?
Apache License 2.0
71 stars 2 forks source link