issues
search
jiaweizzhao
/
GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Apache License 2.0
1.24k
stars
131
forks
source link
Seems not compatible with DeepSpeed
#12
Closed
geniusalert
closed
3 months ago
geniusalert
commented
3 months ago
Seems not compatible with DeepSpeed
jiaweizzhao
commented
3 months ago
Refer to #2
Seems not compatible with DeepSpeed