Open asalzburger opened 3 years ago
For the second bullet:
Plotted the number of hits for of every particle for a specific event. Made some observations regarding the "most common" number of hits, which is 14. I wanted to see if this holds for most (if not all) the events. So I computed the number of hits of every particle for all the events:
which suggests that our initial thought was correct.
For the third bullet:
I started by plotting all the tracks for one event:
Of course, this does not provide much insight. Let's take the hits for one particle only:
We would like to fit a circle in this track (since ideally the particle follows a circular trajectory). Effectively, we want to learn a radius and a center. Since a function must have only one output (and in the Cartesian circle, any "x coordinate" may correspond to 2 "y coordinates"), we can use a transformation to the polar coordinates system. This will transform the circle into a straight line, which will allow us to use linear regression to fit it. The coordinates in the polar plane will look like this:
Since we only have a few datapoints (on average 14 hits) we should make sure not to overfit the given data. That's why we will use Ridge Regression, a Linear Regression algorithm that uses regularization. The polar coordinates will be fit like so:
which will translate in the following curve in the Cartesian plane:
After having computed this chord of the circle, we can sample any 3 points from it in order to recreate the actual circle. Note that those 3 points are not from the given dataset, they are estimations that the regressor has made. The circle we will find is:
Using the radius we found from the circle, we can compute the estimated transverse momentum and compare it to the ground truth like so:
which yields quite accurate results.
Very nice.
This is really good!!
Regarding Task 2, I added some more plots in the corresponding Notebook.
Total number (aggregate) of hits per eta values in the ranges [0, 0,1], [0.1, 0.2], ...
I wasn't sure is the dataset is imbalanced (i.e. more events with specific eta values), so I plotted also the average number of hits for every eta value in the ranges mentioned above:
They are all close to 14, which looks like to be the true average.
Fixed the eta plots:
I think there is still a small problem with the total number of hits vs track eta: it seems that the bin [0, 0.1] is not filled and that the [-0.5, -0.4] has too many entries...
It's true! I had a small bug when rounding eta. Thanks for pointing it out! I fixed it, now the plots look like this:
In
data/pdg13-n25-0.5to10GeV-0.5eta
there are more complicated input files. The simulation is now done withInfo: curvature, magnetic field, particle mass.