Closed belforte closed 3 weeks ago
the problem with ServerTime is very strange, it is always defined
belforte@vocms0137/crab> condor_q -con 'jobuniverse==5&&jobstatus==1' -af servertime|uniq
1717682834
belforte@vocms0137/crab>
seems problem with python bindings. There was a bug in HTC that made it go missing, was fixed in 10.0.2 https://htcondor.readthedocs.io/en/v10_0/version-history/lts-versions-10-0.html#version-10-0-2
yet...
[crabtw@vocms0137 ~]$ python3
Python 3.9.18 (main, Jan 24 2024, 00:00:00)
[GCC 11.4.1 20231218 (Red Hat 11.4.1-3)] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import htcondor
>>> htcondor.version()
'$CondorVersion: 10.2.0 2023-01-05 BuildID: 621409 PackageID: 10.2.0-1 $'
>>> constraint="jobuniverse==5"
>>> projection=['ServerTime']
>>> sk=htcondor.Schedd()
>>> rr=sk.query(constraint, projection)
>>> rr[0]
[ ]
>>>
while from my account, where I have more recent bindings in my local
belforte@vocms0137/~> python3
Python 3.9.18 (main, Jan 24 2024, 00:00:00)
[GCC 11.4.1 20231218 (Red Hat 11.4.1-3)] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import htcondor
>>> htcondor.version()
'$CondorVersion: 23.7.2 2024-05-16 BuildID: UW_Python_Wheel_Build $'
>>> constraint="jobuniverse==5"
>>> projection=['ServerTime']
>>> sk=htcondor.Schedd()
>>> rr=sk.query(constraint, projection)
>>> rr[0]
[ ServerTime = 1717687105 ]
>>>
looks like 10.2.0 is some separate branch (non LTS) which did not get the fix.
Of course we could use time.time()
instead, since script runs on same machine as the scheduler, no reason to ask the scheduler what the time is !
it is not working since a while
I think it was broken my me in March when I removed creation of
/etc/condor/config.d/90_jobrouter.config
in puppet https://gitlab.cern.ch/ai/it-puppet-hostgroup-vocmsglidein/-/commit/2a3422ba2d3b8101414bc9606d67331bce3d4b94So even if we have
enable_overflow: true
in thedata/fqdns/vocms*.yaml
files, the condor config. variableJAT_ENABLE_OVERFLOW
is not set and JobAutoTuner log hasMore changes are needed, because when I set it to True via 99_local_tweaks, log is
So two action items, and no quick solution: