Add Cutlass Blockwise Kernel to Quantize Benchmark

pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

Other

1.18k stars 486 forks source link

Add Cutlass Blockwise Kernel to Quantize Benchmark #2800

Closed jwfromm closed 3 months ago

jwfromm commented 3 months ago

Summary: This diff adds the new cutlass blockwise kernel added in D57965065 to quantize_bench.py. I also set a default block size to make the API conformant with the triton quantize op that is often paired with it.

Reviewed By: choudharydhruv

Differential Revision: D59249763

facebook-github-bot commented 3 months ago

This pull request was exported from Phabricator. Differential Revision: D59249763

netlify[bot] commented 3 months ago

Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
Latest commit	95ef59be0f57a44b179e0908551db7ba69cf9ccd
Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66833c4708e15400085ef9ed
Deploy Preview	https://deploy-preview-2800--pytorch-fbgemm-docs.netlify.app
Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot commented 3 months ago

This pull request has been merged in pytorch/FBGEMM@ece4c24ea77dd5ab34a8c48962f0475646d2b3be.

pytorch / FBGEMM

Add Cutlass Blockwise Kernel to Quantize Benchmark #2800

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Deploy Preview for pytorch-fbgemm-docs ready!