Allow manual specification of kernels in fp8 rowwise

pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

Other

1.17k stars 475 forks source link

Allow manual specification of kernels in fp8 rowwise #2951

Open jwfromm opened 1 month ago

jwfromm commented 1 month ago

Summary: For performanc evaluation purposes, it may be convenient to specify a specific kernel be used for a given workload. This diff makes it easy to do so from python by adding the new kernel_name argument to fp8_rowwise.

When kernel_name is provided, we use a lookup table to grab the specific kernel. When its not provided, we use our usual dispatch and heuristics.

Differential Revision: D60972273

netlify[bot] commented 1 month ago

Deploy Preview for pytorch-fbgemm-docs ready!

Name	Link
Latest commit	ae2305dbabddd4f5d60d3659e1d48ffc6f009b93
Latest deploy log	https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/66b503e3a175200008bb6ee6
Deploy Preview	https://deploy-preview-2951--pytorch-fbgemm-docs.netlify.app
Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

facebook-github-bot commented 1 month ago

This pull request was exported from Phabricator. Differential Revision: D60972273

pytorch / FBGEMM

Allow manual specification of kernels in fp8 rowwise #2951

✅ Deploy Preview for pytorch-fbgemm-docs ready!

Deploy Preview for pytorch-fbgemm-docs ready!