Closed ickc closed 1 year ago
Copied from email thread:
From @rwf14f (Robert):
use:
should_transfer_files = YES
when_to_transfer_output = on_exit_or_evict
Setting should_transfer_files to IF_NEEDED or NO only works if there's a shared file system between all nodes if I remember correctly. And we currently don't have this.
Copied from email thread:
I got the same error after setting that to YES.
Error from [slot1_2@wn5916340.in.tier2.hep.manchester.ac.uk](mailto:slot1_2@wn5916340.in.tier2.hep.manchester.ac.uk): Failed to transfer files
This example uses /bin/sleep so I think transferring is not necessary. But the problem is that even if I set it to NO, it would idles indefinitely.
Copied from email thread:
From Robert:
looks like this might be a configuration issue, some of the worker nodes work fine while others cause this problem. I'll need to further look into this.
Copied from email thread:
From Robert:
the file transfer errors should be gone now.
Copied from email thread:
I’m following https://htcondor.readthedocs.io/en/latest/users-manual/parallel-applications.html#simplest-example to test submitting jobs to the parallel universe.
Upon submitting that, the error I got is
And then the job would seems to be stuck in the queue and idle forever.
If I submitted a slightly modified example of
Then the job would not fail, but seems to be stuck in the queue and idle forever too.
How to solve this?