Closed afernez closed 3 years ago
Phoebe suggested that the peaks were due to the truth-matching failing (the one at q2 ~ 4 GeV^2 would be the D* truth-matching succeeding, but the B failing), and Alex confirmed by applying abs(b_TRUEID)=511 && abs(dst_TRUEID)=413
Just to provide plots for this point that the outlier bins were due to truth-matching failing, here are plots of q2 for runs 1 and 2, with shared events from the no reco tree in black, events from the reco tree in grey, and reco events with truth-matching applied for the various particles in the decay in the various colors:
Notable things:
As observed in slides 4, 5 of this presentation, in the true q2 histograms for our run1 and run2 Bd2DstMuNu MC, there seem to be 2 bins that, for the reconstructed tree wrt the non-reconstructed tree, are overfilled for an unknown reason. The "outlier" bins in question are near q2=0,4:
I don't think I've identified the "cause" of these outliers, but there are a few points I've investigated that I wanted to highlight.
.dst
files for the decay)..dst
files, and I apply "compensating" cuts when making the plots to try to account for any differences between the production of the ntuples (the main differences between Phoebe and our run 1 ntuples are due to the different reco scripts and, relatedly, DaVinci versions). As of right now, these yields (found in the presentation) are relatively close (19.7% for Phoebe vs 18.0% for us), but out of curiosity I plotted the true q2 distribution for Phoebe's ntuple vs ours, to see if it had similar outlier bins (Phoebe didn't have a non-reconstructed tree, like we do in our ntuples, so I'm comparing reconstructed events), and it seems that the remaining excess events in Phoebe's ntuple wrt ours, after I apply compensating cuts, seem to be primarily in the two outlier true q2 bins (about 7000 out of about 10000, to be precise). That is, indeed Phoebe's ntuple also seems to have these strange outlier bins (not confirmed outliers in her case, since there's no non-reconstructed tree to compare to, but I suspect these bins would be outliers for her like they are for us), and in fact hers are much more pronounced than ours. As a note for the first plot after this point: all candidates are included for each event. Related to point 1 above, the second plot following this point shows the number of candidates in each event, and it appears Phoebe has many more multiple candidate events than us (and even some events with 3 or more candidates). It would be interesting to see if Phoebe's outlier excess in the first plot became significantly reduced if only one candidate were kept per event (as a side note, I didn't produce this plot for now because I'm not sure how to keep track of event numbers when usingplot_scripts
, which is why I wrote my own macro to make plots with shared events between trees/with only one candidate per event, but also I haven't implemented the compensating cuts inside my macro; of course, I could address one of these two issues and make the plot, if there's sufficient interest). My expectation is that the outliers would be reduced, along with her yield becoming more similar to ours.Final thoughts: I'm fairly confident in the correlation between multiple candidates and the excesses in the outlier bins, but I'm not sure what this correlation could possibly imply. Perhaps true q2=0 events are more likely to be reconstructed with multiple candidates (but why?), but why would q2=4 also be special? Even more, I note in point 1 that I don't believe multiple candidate events can account for all of the excess in the outlier bins, so even if this correlation were explained, where is the other excess originating from? We can discuss this in the meeting Tuesday, but for now just FYI @manuelfs