Open opoplawski opened 2 years ago
ucx-trace.log UCX_LOG_LEVEL=data output with current ucx git head.
@opoplawski seems like the issue is in osc/ucx that is passing memh=NULL
That much is clear, what isn't clear to me is where that originates - ucx, openmpi, or the application code.
i'd start with tracking how it becomes NULL in osc/ucx layer
Can we try rerunning once the PR is in? Should be fixed with open-mpi/ompi#10126
Describe the bug
Steps to Reproduce
This is on fedora rawhide with openmpi 4.1.2-0.1.rc1 and ucx-1.11.2-1 and building mpi4py 3.1.2
Setup and versions
cat /etc/issue
orcat /etc/redhat-release
+uname -a
Additional information (depending on the issue)
valgrind doesn't report any other errors before this crash.