intel-analytics / analytics-zoo

Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
https://analytics-zoo.readthedocs.io/
Apache License 2.0
18 stars 4 forks source link

orca.data.image.read_parquet support reading as pytorch dataloader #251

Open yangw1234 opened 3 years ago

yangw1234 commented 3 years ago

API:

orca.data.image.read_parquet(format='tf_dataset|dataloader', input_path=..., transforms=..., config={} **other_kwargs)

Reference: https://github.com/intel-analytics/analytics-zoo/pull/3956 https://github.com/uber/petastorm#pytorch-api

yangw1234 commented 3 years ago

@leonardozcm could you help take a look at this issue?

leonardozcm commented 3 years ago

@leonardozcm could you help take a look at this issue?

Of course, I will have a look.