upperwal / EntangledMPI

Fault Tolerance framework for High Performance Computing [Supports ULFM, replication and checkpointing]
MIT License
2 stars 1 forks source link

Fault Injection Mechanism #10

Closed upperwal closed 6 years ago

upperwal commented 6 years ago

A good fault injector which can kill nodes with a probabilistic model across time and ranks within different nodes.

upperwal commented 6 years ago

Added Fault Injector in GO. Ref. to 8773e32c6348154bb23667e2447468a676de1616