Open missirol opened 1 year ago
A new Issue was created by @missirol Marino Missiroli.
@Dr15Jones, @perrotta, @dpiparo, @rappoccio, @makortel, @smuzaffar can you please review it and eventually sign/assign? Thanks.
cms-bot commands are listed here
assign reconstruction, dqm
- DQM differences are reported for several wfs, listed in [1] and [2]. All the wfs in [1] reported differences only in the DQM folder named
Tracking
, e.g.
FYI @cms-sw/tracking-pog-l2
New categories assigned: dqm,reconstruction
@tjavaid,@micsucmed,@nothingface0,@rvenditti,@emanueleusai,@syuvivida,@clacaputo,@mandrenguyen,@pmandrik you have been requested to review this Pull request/Issue and eventually sign? Thanks
type tracking
Occurred in https://github.com/cms-sw/cmssw/pull/42517#issuecomment-1671956044 between
Intel(R) Xeon(R) CPU E5-2683 v4
(Broadwell) Intel(R) Xeon(R) Silver 4216 CPU
(Cascade lake)Occurred in https://github.com/cms-sw/cmssw/pull/42506#issuecomment-1673099506 between
Intel(R) Xeon(R) CPU E5-2650 v4
(Broadwell)Intel(R) Xeon(R) Gold 5218 CPU
(Cascade lake)Another example in https://github.com/cms-sw/cmssw/pull/42562#issuecomment-1681201183 :
Intel(R) Xeon(R) CPU E5-2683 v4
(Broadwell)Intel(R) Xeon(R) Gold 5218 CPU
(Cascade lake)Another example in https://github.com/cms-sw/cmssw/pull/42610#issuecomment-1685128180 :
Intel(R) Xeon(R) CPU E5-2683 v4
(Broadwell)Intel(R) Xeon(R) Silver 4216 CPU
(Cascade lake)Another example in https://github.com/cms-sw/cmssw/pull/42622#issuecomment-1688082001 :
Intel(R) Xeon(R) CPU E5-2683 v4
(Broadwell)Intel(R) Xeon(R) Silver 4216 CPU
(Cascade lake)Another example in https://github.com/cms-sw/cmssw/pull/42707#issuecomment-1703882846 :
Intel(R) Xeon(R) Silver 4216 CPU
(Cascade lake)Intel(R) Xeon(R) CPU E5-2683 v4
(Broadwell)This happens when IB baseline
and PR relvals
are generated on two different sets of build nodes ( e.g. Openstack based VMs and HTCondor based batch nodes). For now I have updated jenkins jobs to not use HTCondor nodes for baseline/PR relvals. I hope this will reduce the frequency of these DQM differences
The PR tests in https://github.com/cms-sw/cmssw/pull/42497#issuecomment-1670609280 showed unexpected differences in DQM comparisons of physics quantities.
#42497 is (if done correctly) merely a technical update with zero impact on physics outputs ("no changes expected").
The DQM differences are present, but barely visible, possibly compatible with numerical differences in floating-point computations.
DQM differences are reported for several wfs, listed in [1] and [2]. All the wfs in [1] reported differences only in the DQM folder named
Tracking
, e.g.The differences in [2] mentioned the DQM folders
JetMET
andBtag
. The JetMET differences are reminiscent of #39754. Most differences (if not all) seem related to MVA/DNN discriminators (to be confirmed by domain experts).The same PR tests also reported the following message (unexpectedly, based on the PR itself).
This seems to be mostly due to the following message in the post-PR logs (see here)
@makortel noted in https://github.com/cms-sw/cmssw/pull/42497#issuecomment-1671329990 that
cms-sw/cmsdist#8565 was integrated in
CMSSW_13_3_X_2023-08-01-2300
, and it updated Tensorflow to2.12.0
(it was2.6.4
inCMSSW_13_3_X_2023-08-01-1100
).[1]
[2]