nod-ai / sharktank

SHARK Inference Modeling and Serving
Apache License 2.0
7 stars 9 forks source link

[quant] Dedynamize the integer kernels. #85

Closed stellaraccident closed 2 weeks ago

stellaraccident commented 2 weeks ago

This produces precise specializations based on the level of dynamism in use vs punning everything to fully dynamic. As a byproduct, this fixes a bug where some conv/sum kernels did not have a function signature that was sufficiently specialized, resulting in aliasing and bad kernel selection.