Open arogozhnikov opened 9 years ago
Few more.
sklearn.IsotonicRegression
to estimate s/b quantity. [TODO discuss]hep_ml
).Ok, one more idea based on what you said about 'longest sequence' within a row for online triggering.
It's three-stage recognition, very fast
Anyway, we hardly can directly use information about interaction of wires from different layers due to significant impredictable rotations, so this seems a fine intermediate step.
yeah agreed. I am presenting tomorrow to my supervisors, so focusing on offline until the weekend. I am thinking that "sig_like_neighbours" can preform better.
Can we train all the wires just one non-neighbour features (time, energy deposit, layer id), then use this output to define if a wire is signal like, but only when its considered as a neighbour. Does this make sense?
Also, maybe we can try training in neighbours defined via even layers vs. odd layers? I.E. vertical neighbours of layer 3 is layer 1 and layer 2...
Also, maybe we can try training in neighbours defined via even layers vs. odd layers? I.E. vertical neighbours of layer 3 is layer 1 and layer 2...
Shifts are too huge - that's the problem. Hope, you mean layer1 and layer 5 (not 2)?
One more thing why I don't want significally rely on far points - this information we will take via hough transform.
Yes, I did mean layer 1 and 5, point taken about hough transform. what about:
Can we train all the wires just one non-neighbour features (time, energy deposit, layer id), then use this output to define if a wire is signal like, but only when its considered as a neighbour. Does this make sense?
This makes sense, and is very good model of first order approximation. Something like 'baseline' model.
However, time by itself is senseless, for instance. So, you'll lose this information in the model. All information that can be take from pairwise feature interaction will be lost.
sorry, I should clarify: time as a variable = time of hit - time of trigger. lets call it relative_time from now on (this will be reflected in my next push). This variable is much earlier for signal hits, and flat for background.
the only place the output of this 'baseline' model would be used is in the current 'signal like neighbours' which under performs (if you ask me). check the latest LocalBasedFiltering, it shows the peroformance of the features.
together with other things this should be fine. Better use undertrained classifier there (n_trees=10, min_samples_leaf=100)
Forgot something?