issues
search
upperwal
/
EntangledMPI
Fault Tolerance framework for High Performance Computing [Supports ULFM, replication and checkpointing]
MIT License
2
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Integrate Docker to support Process Replication and easy migration
#40
upperwal
opened
6 years ago
0
Major Refactor
#39
upperwal
closed
6 years ago
0
Uncoordinated Checkpoints
#38
bethven
closed
6 years ago
3
IB not configured properly on node1
#37
upperwal
opened
6 years ago
1
Fortran's integer is 4 bytes but MPI_Request is 8 bytes
#36
upperwal
opened
6 years ago
1
Process Manager not updating the update bit properly
#35
upperwal
opened
6 years ago
0
Improving MPI_ANY_SOURCE algo
#34
upperwal
opened
6 years ago
0
MPI_Recv does not support MPI_ANY_SOURCE
#33
upperwal
opened
6 years ago
0
NAS Parallel Benchmark's Verification is mostly UNSUCCESSFUL
#32
upperwal
closed
6 years ago
2
MPI_Wait hangs forever when using comm dup in MPI call
#31
upperwal
opened
6 years ago
0
MPI_Comm_dup is collective call
#30
upperwal
opened
6 years ago
1
Deadlock because of comm correction in different function calls
#29
upperwal
closed
6 years ago
2
Comm correction + manager cmd arguments
#28
upperwal
closed
6 years ago
0
Rank retains previous value across checkpoints
#27
upperwal
opened
6 years ago
0
Now transferring and saving __pass_****_cont_add also
#26
upperwal
closed
6 years ago
0
correct rep_free + abort on diff stack start address
#25
upperwal
closed
6 years ago
0
comm corruption during comm_update
#24
upperwal
closed
6 years ago
2
Added Process Manager
#23
upperwal
closed
6 years ago
0
Different stack starting address
#22
upperwal
opened
6 years ago
1
Removed sleep()
#21
upperwal
closed
6 years ago
0
Fault injector + bugs removal
#20
upperwal
closed
6 years ago
0
Erratic behaviour of PMPIX_Comm_agree
#19
upperwal
closed
6 years ago
1
make MPI_Send and MPI_Recv Fault Tolerant
#18
upperwal
opened
6 years ago
0
Process Manager
#17
upperwal
closed
6 years ago
2
Fault injector
#16
upperwal
closed
6 years ago
0
rep_clear_discontiguous throwing seg fault
#15
upperwal
opened
6 years ago
2
Second PMPIX_Comm_agree not required
#14
upperwal
closed
6 years ago
2
Merge pull request #12 from upperwal/fault_injector
#13
upperwal
closed
6 years ago
0
Added replication map info in readme
#12
upperwal
closed
6 years ago
0
Updated README and travis
#11
upperwal
closed
6 years ago
0
Fault Injection Mechanism
#10
upperwal
closed
6 years ago
1
Add Travis support for Go compilation
#9
upperwal
opened
6 years ago
0
Added Travis
#8
upperwal
closed
6 years ago
0
Create LICENSE
#7
upperwal
closed
6 years ago
0
Create a global job communicator
#6
upperwal
closed
6 years ago
2
Pointer passing in functions will fail replication and checkpoint
#5
upperwal
closed
6 years ago
3
Replication not working with MPI_Finalize()
#4
upperwal
closed
6 years ago
1
Support Request on the fly for async communication
#3
upperwal
opened
6 years ago
0
Different behaviour of "random" function (libc) in compute and replica
#2
upperwal
opened
6 years ago
0
ULFM and Fault Injection
#1
upperwal
closed
6 years ago
0