microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
MIT License
428 stars 34 forks source link

[Dev] Add some tests and examples #228

Closed LeiWang1999 closed 1 month ago

LeiWang1999 commented 1 month ago

This pull request includes several changes across multiple files, primarily focusing on refactoring configurations, improving dequantization processes, and updating test cases. The most important changes include adding configuration options for weight transformation, modifying the dequantization process, and updating test configurations and assertions.

Configuration and Refactoring:

Dequantization Process:

Testing and Assertions:

Miscellaneous: