Open glwagner opened 3 weeks ago
Why don't we test the distributed NonhydrostaticModel here?
or are there tests elsewhere?
The test architectures are specified here:
This was hard to find at first
Are the distributed GPU tests actually running?
I see this:
And then subsequently it looks like the architecture is Distributed{CPU}
.
We need a better way to specify the test architectures?
@simone-silvestri
Damn, it looks like the tests on the GPU are not working because CUDA is not loaded properly. I am trying to address this in #3880. A segmentation fault probably means the MPI is not CUDA-aware. Typically, the MPI that is shipped with MPI_jll is not cuda-aware. A good way to check is
julia> using MPI
julia> MPI.has_cuda()
true
Thank @simone-silvestri, it turns out that I wasn't using CUDA-aware MPI.
Since we don't have GPU tests right now I will also check to make sure that this runs with a proper CUDA-aware MPI.
Not sure how this is possible, but the following code throws a segfault:
I'm running with
(I found this error originally when trying to interpolate a field, but it seems it boils down to a halo filling issue)
This is the error I get:
I'll test CPU then try to see if this situation is tested.