microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
MIT License
190 stars 21 forks source link

[Dev] Fix GEMV Dynamic Scheduling with Splitk #52

Closed LeiWang1999 closed 3 weeks ago

LeiWang1999 commented 3 weeks ago

This pull request primarily involves changes to the bitblas Python package, specifically to the gemv.py, gemv_dequantize.py, general.py, and __init__.py files. The changes include version updates, modifications to function conditions to handle additional cases, and adjustments to test parameters.

Version updates:

Function condition modifications:

Function call modifications:

Test parameter modifications: