Closed novicecpp closed 7 months ago
very good !! thanks.
Worth checking with Rahul that thing with Rucio@DESY are now stable
Rahul told me it got fix from this change https://github.com/dmwm/CMSRucio/pull/766/files I still do not understand. Maybe @dynamic-entropy can elaborate more?
The rucio script that parsed the tfc rules did not add a trailing slash in prefeix /store/temp/rucio
and store/test/rucio
for _Temp
and _Test
rses respectively.
This change ensures that.
This was an issue with configuring RSEs for tokens because of this, and Guy identified this issue.
Thanks man!
There are 2 issues:
This break the hardcode I put in
RegisterReplicas.py
.https://github.com/dmwm/CRABServer/blob/5f3ab24a9f3544995d920610b84405d2d2b4dc3a/src/python/ASO/Rucio/Actions/RegisterReplicas.py#L303-L304
Good things is now DESY normal RSE and Temp RSE protocol consistent (FYI: We get
pfn
path fromrucio.lfns2pfns()
by passing normal RSE to function and assumedTemp
area has same protocol).CURRENT result from:
rucio.get_protocols('T2_DE_DESY_Temp')
(only davs):LAST YEAR result from:
rucio.get_protocols('T2_DE_DESY_Temp')
(only davs):T2_DE_DESY does not change (paste it here for comparison)
Remove the hardcode should fix.
I expected Rucio ASO to survive exception cause by 1, mark job as fail, then let retry mechanism do the rest. However, it not, and block indefinitely. This is due different exception is raised,
RSEFileNameNotSupported
, as shown in rucio_transfer.logThe line that catch exception:
https://github.com/dmwm/CRABServer/blob/5f3ab24a9f3544995d920610b84405d2d2b4dc3a/src/python/ASO/Rucio/Actions/RegisterReplicas.py#L180
It is safe to use just generic RucioException here, and improve the monitoring later to cache this type of error.