iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.
http://iree.dev/
Apache License 2.0
2.47k stars 548 forks source link

Run w7900 tests in serial instead of parallel. #17686

Closed ScottTodd closed 1 week ago

ScottTodd commented 1 week ago

Follow-up to https://github.com/iree-org/iree/pull/17675, attempting a fix-forward for failed builds: https://github.com/iree-org/iree/actions/runs/9551239129/job/26327378915#step:7:280

These runners are unstable when running multiple GPU processes at once.

ci-exactly: build_all, test_amd_w7900

saienduri commented 1 week ago

Looks like this is good to land. Hopefully helps with the crashes 🙏