intel / intel-xpu-backend-for-triton

OpenAI Triton backend for Intel® GPUs
MIT License
143 stars 44 forks source link

[XPU][OptEW] Allow multiple warps in non-sliced dimension #2670

Closed victor-eds closed 1 week ago

victor-eds commented 1 week ago

Allow multiple warps in non-sliced dimension as long as there are n*sub_group_size contiguous elements per warp in the non-sliced dimension.

victor-eds commented 1 week ago

Second step for https://github.com/intel/intel-xpu-backend-for-triton/issues/2562. Need to evaluate whether this is enough work on the pass or we need to extend it.