mlc-ai / relax

Apache License 2.0
140 stars 70 forks source link

[Dlight] Enhance fallback schedule with DecomposeReduction #266

Closed MasterJH5574 closed 1 year ago

MasterJH5574 commented 1 year ago

The current fallback dlight schedule does not decompose the reduction init blocks, which might lead to some correctness issue (observed on Metal while not CUDA).

Doing DecomposeReduction effectively resolve the issue and meanwhile provide (minor) performance improvement.