uber / petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Apache License 2.0
1.78k stars 285 forks source link

Adding instructions on patching pyspark installation with s3 protocol supporting jars #600

Closed selitvin closed 4 years ago

codecov[bot] commented 4 years ago

Codecov Report

Merging #600 into master will not change coverage. The diff coverage is n/a.

Impacted file tree graph

@@           Coverage Diff           @@
##           master     #600   +/-   ##
=======================================
  Coverage   85.68%   85.68%           
=======================================
  Files          87       87           
  Lines        4994     4994           
  Branches      795      795           
=======================================
  Hits         4279     4279           
  Misses        578      578           
  Partials      137      137           
Impacted Files Coverage Δ
petastorm/reader.py 90.24% <0.00%> (ø)
petastorm/fs_utils.py 91.75% <0.00%> (ø)
petastorm/workers_pool/process_pool.py 92.70% <0.00%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update da48ec8...a553673. Read the comment docs.