Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Apache License 2.0
1.78k
stars
285
forks
source link
Adding instructions on patching pyspark installation with s3 protocol supporting jars #600
Codecov Report
90.24% <0.00%> (ø)
91.75% <0.00%> (ø)
92.70% <0.00%> (ø)
Continue to review full report at Codecov.