This is the final set of fixes identified in issue #86. CI works locally for me now.
The changes are all related to API changes and deprecations in pyarrow between 5.0.0 and 8.0.0. Changes were required in our code to directly deal with these as well as other changes following dask modifications to deal with the same.
I have created a new script _create_testdata.py to create test parquet files that are stored in the new tests/test_data directory and these are checked as part of pytest. The last time the CI definitely worked was July 2021 with pyarrow==5.0.0 and dask==2021.7.2 (and the same for distributed). These files are successfully read with up-to-date pyarrow==8.0.0 and dask==2022.7.1. Similarly, test parquet files create with the up-to-date pyarrow and dask are successfully read with pyarrow==5.0.0 and dask==2021.7.2.
This is the final set of fixes identified in issue #86. CI works locally for me now.
The changes are all related to API changes and deprecations in pyarrow between 5.0.0 and 8.0.0. Changes were required in our code to directly deal with these as well as other changes following dask modifications to deal with the same.
I have created a new script
_create_testdata.py
to create test parquet files that are stored in the newtests/test_data
directory and these are checked as part ofpytest
. The last time the CI definitely worked was July 2021 withpyarrow==5.0.0
anddask==2021.7.2
(and the same fordistributed
). These files are successfully read with up-to-datepyarrow==8.0.0
anddask==2022.7.1
. Similarly, test parquet files create with the up-to-datepyarrow
anddask
are successfully read withpyarrow==5.0.0
anddask==2021.7.2
.