microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
MIT License
423 stars 34 forks source link

[Dev] Add support and test case for Ladder Weight only Transformation Matmul Operator #212

Closed LeiWang1999 closed 1 month ago

LeiWang1999 commented 1 month ago

This pull request includes several changes to the bitblas library, focusing on improving the matrix multiplication operations and adding new scheduling capabilities. The most important changes involve updates to propagation handling, scheduler conditions, and test configurations.

Propagation Handling:

Scheduler Conditions:

Scheduler Class:

Typing and Method Signatures:

Test Configurations: