upperwal / EntangledMPI

Fault Tolerance framework for High Performance Computing [Supports ULFM, replication and checkpointing]
MIT License
2 stars 1 forks source link

Fault injector #16

Closed upperwal closed 6 years ago

upperwal commented 6 years ago

Added Fault Injector support with various random distributions. Fault Injection module can read the replication and network map generated by the process manager and MPI application resp and inject fault according to user's configuration.