ulfm-devel / ompi

Open MPI main development repository
https://www.open-mpi.org
Other
0 stars 0 forks source link

instantaneous detection of failure from shared-memory siblings #11

Closed abouteiller closed 6 years ago

abouteiller commented 8 years ago

Original report by Aurelien Bouteiller (Bitbucket: abouteiller, GitHub: abouteiller).


In previous ULFM, when orted detected a failure w/ SIGCHILD, it would report immediately to the dead's processes siblings, significantly increasing the detection speed.

Such feature should be reintroduced using pmix

abouteiller commented 6 years ago

Original comment by Aurelien Bouteiller (Bitbucket: abouteiller, GitHub: abouteiller).


This has been resolved with PMIx integration