Open nikhil-tensorwave opened 3 weeks ago
I've never seen such errors.
Can you check with this branch https://github.com/amd/openmm-hip/pull/14 (https://github.com/StreamHPC/openmm-hip/tree/develop_stream)?
Also can your run ctest without -j (ctest --output-on-failure
)? Perhaps something is wrong with concurrent compilation/running.
Btw, what GPUs do you use?
Switching to that branch and adding gfx942 to the list of GPU architectures fixed the issue! Thank you very much for the help. Also, we're running on Mi300Xs
When building OpenMM-HIP and running
make test
I am running into HIP compiler errors. These errors are of the typeI'm also getting
Runtime environment: ROCm 6.1.1 Ubuntu 22.04 Python 3.10 PyTorch 2.4.0
These were the setup steps used:
When rerunning the make tests, a small percentage will pass.
Any help on this would be appreciated.