Closed gregw18 closed 3 years ago
Do we want to land this PR? If so, can you please rebase since some changes in this PR has landed already.
Whoops - thanks for catching that! I seem to be having some problems with the build, and they don't seem to be related to my code. I'm going to trigger another build Saturday morning to see if it will succeed then.
Added option for using PyTorch for throughput testing, using petastom.pytorch.DataLoader. Had to modify compat.py to return column.data.num_chunks regardless of pyarrow version, as 015 was crashing on column.num_chunks. Added option to pass min_after_retrieve to petastorm.pytorch.DataLoader, to give it similar functionality to TensorFlow. Note that I am just learning Python, so any and all feedback is appreciated!