umd-lhcb / lhcb-ntuples-gen

ntuples generation with DaVinci and in-house offline components
BSD 2-Clause "Simplified" License
1 stars 0 forks source link

Differences between data and MC cut #86

Closed yipengsun closed 2 years ago

yipengsun commented 3 years ago

Here I'll document the differences between data and MC cuts and my implementations. Some of the cuts requires #83 to be finished first.

The general idea is: We DONT require truth-matching, except for Mu (because we have a dedicated Mu mis-ID samples to study the effects of misID).

cut name data MC
K PID PIDK > 4 & !isMuon apply PID weights
Pi PID PIDK < 2 & !isMuon apply PID weights
Mu PID isMuon & PIDmu > 2 & PIDe < 1 & BDTmu > 0.25 has a true Mu and apply official PID weights and uBDT weights (The true Mu requirement is enforced at truth-matching already)
yipengsun commented 3 years ago

I separated the UBDT cuts from the regular PID cuts, because candidates that fail these cuts are still used as some control samples.

yipengsun commented 3 years ago

For run 2, 95% candidates that pass the regular selection cuts also pass the UBDT cut. This is true for both D0 and D* trees.

yipengsun commented 3 years ago

Phoebe has:

I don't see truth-matching there.

yipengsun commented 3 years ago

We confirmed with Pheobe that for K, Pi, we don't require the particle to be truth-matched. This is because we don't have a separate misID sample.

yipengsun commented 2 years ago

Here I document the main difference between the data and MC cuts, as shown in our run 2 postprocessing YAML:

FYI @manuelfs @Svende @afernez

manuelfs commented 2 years ago

This is very helpful, thank you very much Yipeng 🙏