Open tylerjereddy opened 1 week ago
You can ignore the warning for the legion_analysis.cc
translation unit. It is a bug with the -Wstringop-overflow
static analysis which is present in many compilers. You can read more about it here.
The real problem is this:
/lustre/vescratch1/treddy/custom_nvidia/legate/src/cpp/legate/task/variant_helper.h:133:16: error: unable to deduce ‘const auto’ from ‘task_wrapper_<std::invoke_result_t<ncclUniqueId (* const)(const Legion::Task*, const std::vector<Legion::PhysicalRegion, std::allocator<Legion::PhysicalRegion> >&, Legion::Internal::TaskContext*, Legion::Runtime*), const Legion::Task*, const std::vector<Legion::PhysicalRegion, std::allocator<Legion::PhysicalRegion> >&, Legion::Internal::TaskContext*, Legion::Runtime*>, variant_impl, variant_kind>’
constexpr auto entry = T::BASE::template task_wrapper_<RET, variant_impl, variant_kind>;
@tylerjereddy could you please try replacing constexpr auto entry
with constexpr Processor::TaskFuncPtr entry
?
Will do, Venado is down for another day or two I think (this time for a dedicated activity time/reservation I think).
@tylerjereddy Does the compiler provide any notes after the error?
A few thousand lines of C++ spam follow the error IIRC (sorry C++ devs..), but I can share the full log once Venado comes back up if you want.
On the LANL Venado machine, Linux ARM/Grace-Hopper architecture, whether using clang 18 (
Cray clang version 18.0.0
) or gcc-13 (13.2.1
) compiler toolchain (both withnvcc
from CUDA 12.5), the same compilation error arises for a recently-providedlegate
release (we only received a tarball--and the only version info I can find isCMakeLists.txt:set(legate_version 24.09.00)
, but this may be a dev version of that and not a tagged release yet). If you direct me to the appropriate location to grep out an embeddedgit
hash I'll go ahead and do that for you, but I don't have agit bundle
, just a preview release tarball as far as I can tell.Here are the steps I follow on Venado:
Set up of environment and compilation commands
```bash cd /lustre/vescratch1/treddy/custom_nvidia/legate rm -rf arch-linux-cuda-release eval "$(/lustre/vescratch1/treddy/tyler_conda/conda_scratch/bin/conda shell.bash hook)" conda activate legate_custom set +o errexit set +e module load PrgEnv-gnu/8.5.0 export CC=gcc-13 export CXX=g++-13 export CPATH=/opt/cray/libfabric/1.20.1/include:$CPATH export LIBRARY_PATH=/opt/cray/libfabric/1.20.1/lib64:$LIBRARY_PATH export LD_LIBRARY_PATH=/opt/cray/libfabric/1.20.1/lib64:$LD_LIBRARY_PATH module load cudatoolkit/24.7_12.5 module load cray-hdf5-parallel/1.14.3.1 export LD_LIBRARY_PATH=/opt/cray/pe/mpich/8.1.30/ofi/crayclang/17.0/lib:$LD_LIBRARY_PATH export LIBRARY_PATH=/opt/cray/pe/mpich/8.1.30/ofi/crayclang/17.0/lib:$LIBRARY_PATH export PATH=$PATH:/opt/cray/pe/cce/18.0.0/bin export PATH=/opt/cray/libfabric/1.20.1/bin:$PATH ./configure --with-cuda --with-hdf5 --with-gasnet export LEGATE_ARCH='arch-linux-cuda-release' export LEGATE_DIR='/lustre/vescratch1/treddy/custom_nvidia/legate' make -j 64 ```And here is the compilation failure (snipped at the end because the C++ compilation spam is after the error is a bit much):