umd-lhcb / MuonBDTPid

Muon PID with a uboost BDT (in ROOT 5). Also include code for PID efficiency studies
0 stars 0 forks source link

Port Greg/Lucia's run 2 UBDT efficiency evaluator #8

Closed yipengsun closed 2 years ago

yipengsun commented 2 years ago

As we are waiting for the official PIDCalib ntuple production to finish, we can first port the UBDT efficiency evaluator so that it works on the small sample PIDCalib ntuple.

Efficiency evaluation procedure

There's my current understanding on the UBDT efficiency evaluation.

We use sPlot for this. sPlot provides a method to unfold the overall distribution of a mixed sample of events into the sub-distributions of the various species.

The procedures are the following:

Does an efficiency evaluator exist already?

We have discussed about this w/ Greg and Lucia ~1 year ago, and Lucia sent us her code:

/afs/cern.ch/work/l/lgrillo/public/PIDCalibTuples  # This is for Castelao, so unrelated
/afs/cern.ch/work/l/lgrillo/public/forMuonID  # This is mostly Greg's run 2 UBDT applier, not the efficiency evaluator
/afs/cern.ch/work/l/lgrillo/public/forMuonID_Run2Update  # Same as above

After looking at these folders, we agreed that the efficiency evaluator is not shared.

yipengsun commented 2 years ago

Here's a slide from Lucia regarding their progress on updating UBDT run 2 efficiency: https://indico.cern.ch/event/824062/contributions/3446283/attachments/1853034/3042705/SLMuonID_v1.pdf

yipengsun commented 2 years ago

@manuelfs @Svende @afernez The third folder is mostly identical to the second. I also checked the slides, and I don't think the plots there used sPlot technique.

So my conclusion is that we don't have their efficiency evaluation code.

manuelfs commented 2 years ago

I quickly checked, and I think you are right: I don't see the efficiency evaluation code

yipengsun commented 2 years ago

@manuelfs @Svende @afernez I've updated the top post to include my current understanding on the whole efficiency evaluation procedure. Please take a look (and also the sPlot paper, if you have time!).

yipengsun commented 2 years ago

Given that there's a branch called probe_sWeight in the official PIDCalib sample, and we'll use PIDCalib sample directly for Mu UBDT study, the sweight part is not needed.

We'll not bother Greg/Lucia anymore on this.