Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
Hey @selitvin! Great to see your work here :) This looks like a super useful project! Also love your presentation. Seems a shame not to have it on github as well. Very best to you!
Hey @selitvin! Great to see your work here :) This looks like a super useful project! Also love your presentation. Seems a shame not to have it on github as well. Very best to you!