LLNL / scr

SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability for MPI codes.
http://computing.llnl.gov/projects/scalable-checkpoint-restart-for-mpi
Other
99 stars 36 forks source link

possible scheduler interface #506

Open gonsie opened 2 years ago

gonsie commented 2 years ago

We should consider: https://exaworks.org/psi-j-python/ It’s a portable submission interface for job schedulers in python. Maybe this will be helpful in SCR.