simonsobs-uk / data-centre

This tracks the issues in the baseline design of the SO:UK Data Centre at Blackett
https://souk-data-centre.readthedocs.io
BSD 3-Clause "New" or "Revised" License
2 stars 1 forks source link

interactive node: `Failed to read address of starter for this job` #46

Closed ickc closed 5 months ago

ickc commented 5 months ago

When requesting an interactive node, on some assigned host, this error will occur. MWE:

$ cat example.ini
RequestMemory = 32999
RequestCpus = 16
use_x509userproxy = True
queue
$ condor_submit -i example.ini
Submitting job(s).
1 job(s) submitted to cluster 2231.
Failed to read address of starter for this job
$ condor_q 2231 -long
RemoteHost = "slot1_1@wn3805331.tier2.hep.manchester.ac.uk"
...
rwf14f commented 5 months ago

That's one of the machines I'm using to test the upgrade to Rocky 9. I changed the config, your jobs won't try to run on those now.