cmsdaq / DAQExpert

New expert system processing data model produced by DAQAggregator
1 stars 2 forks source link

Test cases: verification after merging recent features #82

Open gladky opened 7 years ago

gladky commented 7 years ago

I'm merging few features to integration branch (fed-hierarchy, new backpressure analysis) and I need your help to verify some test cases. Please confirm that output of DAQExpert is correct for the following cases:

Case 1

http://daq-expert-dev.cms/daq2view-react/index.html?setup=testbed&time=2017-06-09-17:21:56

Are both correct?

Case 2

http://daq-expert.cms/daq2view-react/index.html?setup=cdaq&time=2017-06-22-04:01:25

Is ferol stuck correct?

Case 3

http://daq-expert.cms/daq2view-react/index.html?setup=cdaq&time=2017-06-14-15:56:04

Partition-problem and corrupted-data-received seem fine. But rus-failed seems redundant for this case.

mommsen commented 7 years ago

Case 1: the real reason is in the 2nd bullet. I guess the 1st one shows up because we do not see the backpressure from DAQ on the BPIX FEDs.

Case 2: this issue has been discussed in detail in the email thread "Fwd: ELOG : DAQ : Dump of FEROL40 with FED Id [1232 and 6 more] when blocking the run" on June 22/23. I think the conclusion is that we do not know if the FEROL40 was indeed stuck for a couple of seconds, or if there was a monitoring hiccup or anything else. I would keep the message as is for now and see if we can find another case.

Case 3: The message is indeed redundant. However, the RU error message gives the details about the corruption. Is it possible to add the error message to the 'Corrupted data received' text?

mommsen commented 7 years ago

BTW: if you want to make the RU/EVM/BU error messages better readable, one could parse away anything outside of the quotation marks, i.e. get rid of 'Caught exception: exception::DataCorruption' and ' raised at ...' stuff.

gladky commented 7 years ago

Case 1: So we can keep both and configure the usefulness parameter to deliver the best suggestion to shifter. Right know Ru-failed has lower usefulness than all Known-failures. In this case Fed-stuck will be the primary suggestion. Shell we switch the levels of usefulness? I opened the individual ticket to follow this: #85 If you have suggestions which problems should considered more important please add them.

gladky commented 7 years ago

Case1: Shall we switch the levels of usefulness? Is it correct to say that ru-failed will be always more important and more accurate than fed-stuck?