pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Other
1.18k stars 486 forks source link

Add Cutlass Blockwise Kernel to Quantize Benchmark #2800

Closed jwfromm closed 3 months ago

jwfromm commented 3 months ago

Summary: This diff adds the new cutlass blockwise kernel added in D57965065 to quantize_bench.py. I also set a default block size to make the API conformant with the triton quantize op that is often paired with it.

Reviewed By: choudharydhruv

Differential Revision: D59249763

facebook-github-bot commented 3 months ago

This pull request was exported from Phabricator. Differential Revision: D59249763

netlify[bot] commented 3 months ago

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
Latest commit 95ef59be0f57a44b179e0908551db7ba69cf9ccd
Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66833c4708e15400085ef9ed
Deploy Preview https://deploy-preview-2800--pytorch-fbgemm-docs.netlify.app
Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot commented 3 months ago

This pull request has been merged in pytorch/FBGEMM@ece4c24ea77dd5ab34a8c48962f0475646d2b3be.