foundation-model-stack / fms-acceleration

🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
Apache License 2.0
0 stars 4 forks source link

Initial Addition of FusedOps and Kernels Plugin With Model Patcher #25

Closed fabianlim closed 1 month ago

fabianlim commented 1 month ago

This is an initial addition of the FusedOps and Kernels Plugin

TODO:

Initial Tests on L40

Model Test Tokens /s
TheBloke/Mistral-7B-v0.1-GPTQ No fused-ops/ kernels 2492
TheBloke/Mistral-7B-v0.1-GPTQ With fused-ops/ kernels 3001