Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
I don't know why we need to do this (not saying there isn't a reason, just that the description is blank so I have no context). LGTM though; will stamp once we discuss.
Codecov Report
93.98% <0.00%> (-0.14%)
Continue to review full report at Codecov.