cmsdaq / DAQExpert

New expert system processing data model produced by DAQAggregator
1 stars 2 forks source link

new logic module for detecting high deadtime caused by DAQ itself #138

Closed andreh12 closed 6 years ago

andreh12 commented 6 years ago

sequence of tests should be:

check dead time. If high, continue
        v
check backpressure @ FRL/FEROL/FEROL40. If high continue
        v
check DAQ backpressure at RU (RUs has non-zero fragments but has zero requests from the EVM). If true continue:
        v
check if EVM resources < 100. If true, continue
        v
look for BUs which are not enabled. If there are, continue:
        v
check CPU load on filter farm

Typically the first module in the above list for which the condition is not satisifed causes the problems before it in the chain and is most likely the culprit.

We need to discuss the messages which should be shown in each case.

andreh12 commented 6 years ago

superseded by #141