Release of Trained Models

jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Apache License 2.0

1.43k stars 148 forks source link

Open JLake310 opened 7 months ago

JLake310 commented 7 months ago

Hi, thanks very much for sharing your impressive work!

Would it be possible to release the trained model (e.g., using the script below)? It would greatly facilitate reproducibility efforts. Thank you for considering this request. https://github.com/jiaweizzhao/GaLore/blob/1b36c33782bdd74a4d6a4f51bc626ef67f51011f/scripts/benchmark_c4/llama_7b.sh