Open belforte opened 2 weeks ago
Need also to prepare a knowledge base with strings to look-for/avoid in log, like
A list is accessing an object (0x154e2060ea40) already deleted (list name = TList)
which is not a corrupted file !
as a start, skip HammerCloud and limit to max 30 reports per task
see https://its.cern.ch/jira/browse/CMSTRANSF-1024
should modify https://github.com/dmwm/CRABServer/blob/97f447747265684589ac1f5be773eed80de02239/src/python/TaskWorker/Actions/RetryJob.py#L407 so that suspicious file replicas are reported to Rucio