Open georghildebrand opened 5 years ago
Hi. These formats look interesting. We never evaluated their performance but these do surely look promissing. It would be interesting to try. We are playing internally with making Parquet just another backend implementations alternatives for Petastorm. We can consider making bcolz or zarr another backend alternative for Petastorm.
I think for cases from machine learning, algorithmic etc. that would heavily make sense as i currently see parquet mostly for relational data ... maybe i am wrong in this assumption.
And I would like to see the comparison with lmdb.
Hi all, great work and glad to see progress here. Is there anywhere a comparison between petastrom and HDF5/bcolz/zarr ?