kssteven418 / SqueezeLLM-gradients

Apache License 2.0
14 stars 7 forks source link

Faster implementation for gardient square accumulation #6

Open kssteven418 opened 10 months ago

kssteven418 commented 10 months ago

This will be merged after internal testing