ROCm / triton

Development repository for the Triton language and compiler
MIT License
80 stars 22 forks source link

[MFMA][FRONTEND] Add more options for forced mfma layout sizes #538

Open binarman opened 3 months ago

binarman commented 3 months ago

This PR:

MFMA size heuristic now looks like this:

  1. If kernel specific option is set, pick it
  2. If the result tile shape is larger than 32x32, pick mfma32
  3. If the tile shape is smaller than 32x32 but larger than 16x16, pick mfma16
  4. if the tile shape is smaller than 4x64 or 64x4, pick mfma4x4
  5. Otherwise, pick mfma4x64 or mfma64x4, depending on what tile fits into matrices