Open marianne013 opened 2 weeks ago
Isn't this solved by the last DIRACOS release?
Why did it show up in the hackathon then ?
The last release was created only yesterday: https://github.com/DIRACGrid/DIRACOS2/releases/tag/2.42
Keep the ticket open until the workshop when we do another hackathon ?
During the hackathon pilot jobs at RAL-LCG2 kept failing. I was not able to retrieve the logs of the failed jobs, but from the running jobs I managed to retrieve the following excerpts: pilot.log
pilot.error
We've seen the same issue on our production instance, and we are working around it by getting the pilot off cvmfs. Simon thinks this might be related to: https://github.com/mamba-org/mamba/issues/2501 Note that this behaviour several hundred jobs per hour that then fail, and that this is how my DN got banned at RAL before. (Hence killing all user jobs targeting RAL before leaving the hackthon is a necessity.)