Closed mommsen closed 6 years ago
as discussed with @gladky , we will implement this by keeping track of the timestamp when the RunOngoing
last went from satisfied() = false
to satisfied() = true
in the HltCpuLoad
module.
@mommsen is it ok to introduce a generic holdoff period since the beginning of the run or do you think we should apply the holdoff just for stable beams ?
I think it is good enough to have a hold-off of 1 minute in any case. A short spike in CPU load does not cause any issues.
Thanks @mommsen for confirming.
One minute on the other hand might be optimistic, I've looked at a few recent cases:
RunOngoing
startsRunOngoing
startsBut this is something which one can easily tune later.
added to release 2.10.6
with @mommsen we agreed to use expert.logic.hlt.cpu.load.holdoff.period = 180000
(3 minutes) in production
(added to the production configuration file but commented out for the moment)
Reopening to include @hsakulin suggestion:
Then we need two hold-offs:
E.g. 5 mins from the start of the run And 2 minutes from the start of the condition.
Fixed in 0914f77, released as 2.10.8
There have been several reports that the DAQExpert warns about a too high CPU load, e.g.:
It is indeed expected that the CPU load is high at the start of the run during stable beams. I would suggest to add a hold-off of 1 minute before reporting the high CPU load.
Remi