foundation-model-stack / fms-acceleration

🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
Apache License 2.0
6 stars 12 forks source link

Update Benchmarks and Documentation for GraniteCausalLM #86

Closed fabianlim closed 2 months ago

fabianlim commented 2 months ago

In this PR we update the benchmarks for GraniteCausalLM

Note this PR requires the following dependency updates

Known issues with quant peft

Performance

Overall impressive improvements with kernels.

FULL FT image

PEFT image

Quantized Peft (BNB) image

wynterl commented 2 months ago

awesome, great results @fabianlim

raghukiran1224 commented 2 months ago

Indeed, awesome results @fabianlim !

fabianlim commented 2 months ago

@wynterl @raghukiran1224 the loss for BNB + fused ops looks problematic. ~Needs more debugging~, Ok i found that its because Granite has a bias in the Linear, but the FOAK kernels do not support bias. This just requires some minor (but tedious) modifications