iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.
http://iree.dev/
Apache License 2.0
2.85k stars 621 forks source link

Debug configure ci #19239

Closed bjacob closed 4 days ago

bjacob commented 4 days ago

It's purely device code, so it doesn't need HIP's defining feature of generating both host and device code. It can be just C code that happens to be compiled to the AMDGPU target.

The flags are taken from the users/benvanik/amdgpu branch, build_tools/cmake/iree_amdgpu_library.cmake.

Tested with:

pytest experimental/regression_suite/tests/pregenerated/test_ukernel.py -k gfx942

Notes: