Open jprotze opened 4 months ago
@llvm/issue-subscribers-openmp
Author: Joachim (jprotze)
I can't reproduce with your test and compiler invocation unfortunately. I don't exactly know what's going on in your example. The non-LTO build of NVPTX targets should use -mlink-builtin-bitcode
to resolve the OpenMP runtime per-TU. This is silently broken for anything that shares state between files, but it should result in having only a single .cubin
, so I'm struggling to figure out where the other one is coming from.
Building an OpenMP hello world fails to link, when Optimization is turned on:
Building like:
This fails with
I tried with a quite recent build from main (f7c2e5fa05). I tried with a release 17 build (6009708b). Both versions show this issue. I tried on different of our systems with different GPUs equipped, with different versions of CUDA (11.6/11.8/12.1.1). The result is consistent with clang 17 or newer. I tried a clang/16.0.6 build, which succeeds to build with any optimization level.
@jhuber6 did you see something like this before?