kssteven418 SqueezeLLM-gradients issues - Githubissues

kssteven418 / SqueezeLLM-gradients

Apache License 2.0

12 stars 7 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

CUDA errors running SqueezeLLM-gradients

#15 georgelund opened 4 months ago
1
fix device mismatch bug

#14 kssteven418 closed 4 months ago
0
Faster gradient square accumulation using hooks

#13 SyphonArch closed 4 months ago
0
Fix C4 Loading

#12 sidjha1 closed 5 months ago
0
missing requirement file

#11 kssteven418 closed 5 months ago
0
missing file added

#10 kssteven418 closed 7 months ago
0
basic implementation for llama and mistral

#9 kssteven418 closed 7 months ago
0
Update README.md

#8 kssteven418 closed 7 months ago
0
num_linear_layers not found in Llama

#7 Quang-elec44 opened 7 months ago
0
Faster implementation for gardient square accumulation

#6 kssteven418 opened 7 months ago
0
Size mismatch for model layers when trying to compute gradient for llama-2-70b

#5 tjtanaa opened 8 months ago
0
Update README.md

#4 kssteven418 closed 9 months ago
0
Update README.md

#3 kssteven418 closed 9 months ago
0
Update README.md

#2 kssteven418 closed 9 months ago
0
Add gradient computation

#1 kssteven418 closed 9 months ago
0