uber / petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Apache License 2.0
1.8k stars 284 forks source link

Is there support for gs datasets? #468

Open skeller88 opened 4 years ago

skeller88 commented 4 years ago

Similar to https://github.com/uber/petastorm/issues/210

selitvin commented 4 years ago

419 has a link to a fork that attempts to implement such support. We don't have anything official on petastorm master yet...

selitvin commented 4 years ago

472 implements the requested support.Will be releasing a new version that will include the support in the following days...