Closed tony-johnson closed 8 years ago
A few issues came up:
All that being said, I was able to submit jobs, but they immediately failed, probably because no wall clock time was specified although I'm not sure on that.
@brianv0 now has the workflow engine running at NERSC, and we have been able to run a small sample of TwinklesDM jobs using it.
The current implementation does not use NEWT, but instead requires that the daemon is run directly on a login node at NERSC. This seems to work fine, although currently it has to be manually started.
One remaining bug is that rollback does not currently work, but Brian will hopefully get that fixed today. Once that is done the workflow is in principle ready to be used for running Run2 at NERSC.
Rollback is fixed. Closing this issue since basic functionality is now complete. Will open other issues for any additional work.
Hooray! Nice work, you two. So cool to have gained a supercomputing center :-)
On Thu, Jun 9, 2016 at 9:00 AM, Tony Johnson notifications@github.com wrote:
Closed #85 https://github.com/DarkEnergyScienceCollaboration/Twinkles/issues/85.
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/DarkEnergyScienceCollaboration/Twinkles/issues/85#event-687450152, or mute the thread https://github.com/notifications/unsubscribe/AArY9_ySL2yqc_G-W5OWzDnDcEgn5UMpks5qKDiQgaJpZM4HBUDp .
Needs to be upgraded to new batch system (SLURM) at NERSC