Closed Aariq closed 1 year ago
Ok, seems to be working consistently now. Running two remote commands in a row seemed to be causing problems, but a 1 second pause seems to fix it. 🤷♂️
Now that I've been using this for a while I've noticed that it still fails to copy over all the run/
files sometimes despite rsync
reporting "success". This definitely works better than it did previously, but I can also continue to try to figure this out.
Marked as draft again because I'm getting close (hopefully) to really fixing this.
Ok, the remaining bug has nothing to do with PEcAn or with R for that matter. I've been able to reproduce it with just rsync
. It also seems to be specific to our server as I can't reproduce it from my laptop. Going to open this back up for review. One possible safeguard is to wrap the remote.copy.to
commands in PEcAn.utils::retry.func()
but I'll wait to get feedback before doing this.
@robkooper could you look at the changes you requested so we can figure out whether this is ready to pull?
Addresses #3019 with changes to
start_model_runs()
Description
remote.copy.to
outside of for loop to copy over all /out and /run directories at once.remote.copy.from
outside of for loop to copy all outputs back to local at once.remote.copy.to()
andremote.copy.from()
There aren't any tests for
start_model_runs()
and I'm not sure how to write them, but I did install this PR and test it with my own setup.Motivation and Context
Review Time Estimate
Types of changes
Checklist: