vepadulano / PyRDF

Python Library for doing ROOT RDataFrame analysis
https://pyrdf.readthedocs.io/en/latest/
9 stars 7 forks source link

Initial support for AsNumpy #78

Closed vepadulano closed 5 years ago

vepadulano commented 5 years ago

AsNumpy pythonization is now supported both locally and distributedly. This initial implementation is functional, but not definitive.

AsNumpy works locally with the 6.18 version of ROOT. In order for it to work distributedly some changes to the PyROOT AsNumpy function have to be made. The changes involve bringing the ndarray class, previously defined inside the function scope, to the global scope. In this way that class would be picklable during the execution of the Spark job.

A few tests have been added, but they will be marked as expected failures until a fix on the PyROOT side will happen. (PR)

The df026_AsNumpyArrays.py tutorial has been added both for the local and the Spark environments.

vepadulano commented 5 years ago

Of course also the tutorials are failing at the moment. Will remove them and add them back at a later stage