microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
MIT License
190 stars 21 forks source link

[DEV][FP8] Improve e4m3 decoding #43

Closed LeiWang1999 closed 1 month ago

LeiWang1999 commented 1 month ago

This pull request primarily focuses on refining the type conversions and adjusting the precision in the testing function. The changes are aimed at improving the efficiency and accuracy of the code.

Here are the key changes:

Type conversion refinement:

Precision adjustment: