SCR caches checkpoint data in storage on the compute nodes of a Linux cluster to provide a fast, scalable checkpoint / restart capability for MPI codes.
We should consider: https://exaworks.org/psi-j-python/
It’s a portable submission interface for job schedulers in python. Maybe this will be helpful in SCR.
We should consider: https://exaworks.org/psi-j-python/ It’s a portable submission interface for job schedulers in python. Maybe this will be helpful in SCR.