jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Apache License 2.0
1.24k stars 131 forks source link

Seems not compatible with DeepSpeed #12

Closed geniusalert closed 3 months ago

geniusalert commented 3 months ago

Seems not compatible with DeepSpeed

jiaweizzhao commented 3 months ago

Refer to #2