intel / intel-xpu-backend-for-triton

OpenAI Triton backend for Intel® GPUs
MIT License
144 stars 44 forks source link

[XPU][OptRed] Revamp `-tritonintelgpu-optimize-reduction-locality` #2800

Open victor-eds opened 3 days ago

victor-eds commented 3 days ago

Original implementation had two critical issues:

This was fixed as follows:

See implementation for further details.

Closes #2752