Closed naodell closed 1 year ago
As explicitly written in the README since a while, root_pandas, and root_numpy on which it depends, has been deprecated and effectively unmaintained for quite a while. We decided to close anthing outstanding as "won't do" and archive the package at this point.
This is an awesome tool!
Considering that HEP data typically has variable number of objects and has a natural division of data (an event) it would be nice if this could be taken into account when converting a root file. I think this would be addressed (as suggested in the title) by being able to specify a multi-index when calling read_root. So for instance, I have event-by-event data in a root file and each event has a several vectors which are consistently sized within an event. I would like to specify something like:
df = read_root('my_data.root', columns=['track_pt', 'track_eta', 'track_phi'], index=['event', '__array_index'], flatten=True)
I was able to do this in two steps with just
track_pt
At some level this is a quality of life request, but this did not work when I specified additional variables. Additionally it would be nice if you could somehow read by number of events instead of chunk size though maybe that's tricky to implement.
Thanks, Nate