I had a bunch of simulations get stuck, and I suspect the issue is with generating the initial particle configuration because the volume fraction was too high. _pack_particles should have thrown an error to kill the processes but it is only called by the root rank, so when it throws, it probably made a deadlock for the non-throwing ranks at a later barrier (that the root rank never reached).
The issue would probably also be present for HOOMD, but I haven't tested.
I had a bunch of simulations get stuck, and I suspect the issue is with generating the initial particle configuration because the volume fraction was too high.
_pack_particles
should have thrown an error to kill the processes but it is only called by the root rank, so when it throws, it probably made a deadlock for the non-throwing ranks at a later barrier (that the root rank never reached).The issue would probably also be present for HOOMD, but I haven't tested.
I think this is a known behavior of mpi4py
https://mpi4py.readthedocs.io/en/stable/mpi4py.run.html
We should document this behavior and solution.