microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
MIT License
423 stars 34 forks source link

[Dev][TL] Following updates of Tile Language Backend #226

Closed LeiWang1999 closed 1 month ago

LeiWang1999 commented 1 month ago

This pull request includes several updates to the bitblas library, focusing on the matmul operations and their testing. The most important changes involve making parameters optional in the apply_config function, adding a new scheduler for fine-grained dequantization, and enhancing test coverage for the new scheduler.

Enhancements to apply_config function:

Addition of Fine-Grained Dequantization Scheduler:

Test Coverage Enhancements:

LeiWang1999 commented 1 month ago

TODO Items: