Comparison with CTA-MARS: Energy estimation

This issue is part of a project described in issue #24.

The following is a "real-time" list of points that are found to be differences between the pipelines using the comparison. Not all features are critical to recovering the missing performance, but all should be implemented (as more similar as possible) in order to allow their optional use when comparing different algorithms.

[x] Add the possibility to use a Random Forest

Currently, protopipe uses an Adaptive Boost Regressor based on a Decision Tree, while CTAMARS uses a Random Forest regressor.

[x] Add missing parameters
- [x] Concentration (fraction of the total Intensity which is contained in the two brightest pixels of the cleaned image) - #132

This is not defined in the same way in ctapipe, not sure if we should add it this way, better wait to see if the difference in definition plays a role in the overall performance of the pipeline.

[x] Leakage1 (fraction of total Intensity which is contained in the outermost pixels of the camera)
[x] log10(Width*Length/Size)
[x] square of distance from Image c.o.g. to the reconstructed event direction on the camera (dir_x, dir_y)
[x] atan2(cog_y - dir_y, cog_x - dir_x)

The last 3 features may require a small enhancement in the management of the features read from the configuration file and form the scripts which produce DL2 data (see #90)

[x] Get RMS from each Random Forest regressor.

Right now we get only the estimate but we should get also a measure of the variance from the trees estimates. This is used also in the weighting for the gammaness estimation for the DL2-candidate event.

[x] Even though we can get the RMS out of the trees we are missing a detail

From the wiki page of the CTAMARS analysis,

the RFs provide estimates for log10 E and its RMS, but the average is done after converting those to linear energy scale)

and now issue #139 is breaking this behaviour because this operation is performed on the base-10 logarithm scale!

[x] Modify (better: allow for configurable) weight (#125)

This issue will be useful also for #93. Right now we weigh with Intensity, while CTAMARS uses 1/RMS^2 where RMS comes from each Random Forest regressor.

[x] Recover missing performance below few hundreds of GeVs

As shown here we have lost sensitivity at low energies (mainly due to the necessary changes between 0.3.0 and 0.4.0). Currently, it is not clear if with the previous 2 points this will be solved or it will require to fix/add something in DL1 and/or DL2a.

[x] Modify usage/training (configurable, to check)

CTAMARS uses the whole gamma-1 and samples to train the classification model, whereas protopipe splits the original TRAINING data into train/test sub-samples. This allows applying intermediate benchmarking before applying the models to the rest of the analysis data sample (DL2 production takes more time and it could be convenient to make studies on the models without producing every time DL2 data). In the case of energy estimation, this could represent a minor problem than classification, in fact, the energy estimation benchmarks can be applied (as of 0.4.0) to the gamma-2 sample, which is used to train the classification model.

cta-observatory / protopipe

Comparison with CTA-MARS: Energy estimation #92