IntelPython / sdc

Numba extension for compiling Pandas data frames, Intel® Scalable Dataframe Compiler
https://intelpython.github.io/sdc-doc/
BSD 2-Clause "Simplified" License
646 stars 62 forks source link

Migrating to pyarrow=4.0.1 #982

Closed kozlov-alexey closed 3 years ago

kozlov-alexey commented 3 years ago

Motivation: keep up with the latest versions of dependencies

kozlov-alexey commented 3 years ago

/AzurePipelines run

azure-pipelines[bot] commented 3 years ago
Azure Pipelines successfully started running 1 pipeline(s).
kozlov-alexey commented 3 years ago

The change in pandas_read_csv is adaptation for https://github.com/apache/arrow/pull/9203 that changed date-inferred type from TimestampType to DateType and caused sdc.tests.test_io.TestCSV.test_pyarrow (test='csv_date1') to fail since to_pandas converts it back to strings (unless date_as_object=False is used).