Closed rom1504 closed 2 years ago
works but reading parquet from s3 is quite slow when the metadata is strings. Reading from local is much much faster I believe the reason is that the .slice of pyarrow on parquet is not doing anything very useful and is reading much more than it should maybe the only fast solution for parquet would be to have a local cache
also stop the readers on error
15