Open victor-eds opened 1 week ago
Our backend introduces a new way to perform layout conversions via shuffles. Modify the --intel-allocate-shared-memory pass to not introduce barriers before shuffles not using SLM at all and thus not needing barriers.
--intel-allocate-shared-memory
Note this may be fixed by #2556.
Depends on #2611. This will enable us to optimize SLM allocations and not inserting additional barriers.
Our backend introduces a new way to perform layout conversions via shuffles. Modify the
--intel-allocate-shared-memory
pass to not introduce barriers before shuffles not using SLM at all and thus not needing barriers.Note this may be fixed by #2556.