microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
MIT License
190 stars 21 forks source link

[Dev] Improve General Matmul With Splitk #50

Closed LeiWang1999 closed 3 weeks ago

LeiWang1999 commented 3 weeks ago

This pull request includes changes to the bitblas library and its associated tests. The most significant changes include enabling debug output in QuickStart.md, modifying the forward method in python/bitblas/module/__init__.py and python/bitblas/ops/general_matmul_splitk.py, and adjusting the testing scripts testing/python/operators/test_general_matmul_fp8.py and testing/python/operators/test_general_matmul_splitk_ops.py.

Debug output:

Codebase modifications:

Testing script adjustments: