issues
search
zyushun
/
Adam-mini
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
198
stars
8
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
使用Adam-mini,全参量微调7B模型需要多少显存?
#13
NTDXYG
opened
22 hours ago
4
Publish to pypi
#12
winglian
opened
1 day ago
1
How to Use AdamMini Optimizer with Weight-Tying on Models like qwen0.5b?
#11
relic-yuexi
closed
2 days ago
15
Add a setup.py and some pip install related changes
#10
Mrw33554432
opened
4 days ago
0
Fix code that causes UnboundLocalError
#9
Mrw33554432
closed
5 days ago
2
Memory saving only in checkpoint size, not during training
#8
aditya2331
opened
1 week ago
7
works well on smaller models, updates for torchtitan and 8B size
#7
lessw2020
opened
1 week ago
23
chore: update data_utils.py
#6
eltociear
closed
1 week ago
0
Adam mini can't save when using with FSDP in Huggingface Trainer
#5
hahuyhoang411
opened
2 weeks ago
18
Is there any plant to implement Quantized Adam-mini?
#4
Kyeongpil
opened
2 weeks ago
4
Adam mini can't offload to CPU
#3
hahuyhoang411
closed
2 weeks ago
2
Adam mini isn't compatible with HuggingFace
#2
hahuyhoang411
closed
2 weeks ago
2
Replaced Hardcoded CUDA Calls
#1
hunterqueb
closed
2 weeks ago
1