-
WmCore retries jobs multiple times to take care of temporal issues within the system.
I can't find any monitoring that tells me on average how many times a job gets resubmitted. The vast majority of…
-
**Impact of the bug**
MSOutput
**Describe the bug**
A change in Rucio caused tape output rule creation to fail and we missed this for 3 weeks causing 7 PB of tape transfers to pile up.
**How …
-
**Tests started for following configuration:**
**Configuration:**
- CRABClient_version: **prod**
- REST_Instance: **prod**
- CMSSW_release: **CMSSW_13_0_2**
- SCRAM_ARCH: **el8_amd64_gcc11**
**Te…
-
in CRABRestInerface the output of curl is piped to stderr,
this makes the code wrongly assume that the call to serve failed if the output contains a string reporting an HTTP error, like when task sub…
-
with ref. to
https://github.com/bbockelm/cms-htcondor-es/blob/master/src/htcondor_es/convert_to_json.py#L532
there are now more DAGs around in for CRAB due to automatic splitting which have task nam…
-
it is very simple to insert a block via a curl command
https://github.com/dmwm/dbs2go/blob/master/docs/DBSWriter.md
so in case client API fail (see e.g. #7310 ) it is worth to retry via curl
-
modify K8s secrets so that crabserver's config.py abd CRABServerAuth.py
are outside the secrets and the latter allows multiple DB instances, like already
done in VM installation [1].
I.e. keep in t…
-
We hit a couple of times the problem where HC appears to stop running at several sites.
This is because when `schedd` restarts and DAGMAN is restarted, the current code does not cope with it graceful…
-
**Impact of the bug**
Current workqueue logs lack of timestamp for each entry in the log. When we need to debug some workflows issues, e.g. https://github.com/dmwm/WMCore/issues/11784, it would be ex…
-
**Impact of the new feature**
Deployment process of all WMCore Central Services
**Is your feature request related to a problem? Please describe.**
In the light of the work related to giving up th…