mabarnes / moment_kinetics

Other
2 stars 4 forks source link

MPI issues #148

Closed johnomotani closed 8 months ago

johnomotani commented 8 months ago

We get MPI errors if we run in parallel with the latest version of all the dependencies (as of 19/11/2023).

I think the problem is a slightly complicated mixup, that will probably be fixed in the Julia packages relatively soon:

Possible workarounds:

  1. Wait for the Julia packages to be fixed, then the problem should go away.
  2. Pin HDF5_jll to a slightly older, working version (i.e. version 1.12.x) at least until the Julia packages are fixed.
  3. Get rid of the NetCDF file I/O.
  4. Tell NCDatasets to use a system-provided libnetcdf.so, so it doesn't link to the HDF5_jll.jl version of HDF5. On systems where we have to compile HDF5 for ourselves, this would be annoying as we would have to compile NetCDF as well, and link it to the local version of HDF5.

I think option 2 is the best and easiest solution, while we wait for a fix to https://github.com/JuliaPackaging/Yggdrasil/issues/6893. I think it's possible to tell Julia to pin a package to a certain version, rather than everyone having to do it by hand (and even if we did it 'by hand' the CI jobs would have to do the same thing, which would probably be more work than pinning a package). I'll try to make a PR...