uber / petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Apache License 2.0
1.78k stars 285 forks source link

Move gcsfs library to testing dependencies #586

Closed selitvin closed 4 years ago

selitvin commented 4 years ago

gcsfs is a too heavy dependency to force on a user as is drags in some transient dependencies incompatible with some of our users environemnts.