openvinotoolkit / oneDNN

oneAPI Deep Neural Network Library (oneDNN)
https://01.org/dnnl
Apache License 2.0
16 stars 42 forks source link

[FORK][FEATURE] IP weights compression: mxfp4 (wei=f4e2m1, scales=f8e8m0) #258

Closed dmitry-gorokhov closed 1 month ago