Closed galipremsagar closed 2 years ago
Our conda environment mentions - pandas>=1.0,<1.4.0dev0
are we moving to/beyond pandas 1.4.0 in cudf 22.06?
Relevant pandas commit https://github.com/pandas-dev/pandas/commit/7f06a8a577d85617d08570ef328851161a4ddf35
Our conda environment mentions
- pandas>=1.0,<1.4.0dev0
are we moving to/beyond pandas 1.4.0 in cudf 22.06?
Describe the bug Starting
pandas 1.4.0
, there is a change in behavior ofmangle_dupe_cols
incudf.read_csv
. The change in behavior is explained well here: https://pandas.pydata.org/docs/whatsnew/v1.4.0.html#mangle-dupe-cols-in-read-csv-no-longer-renames-unique-columns-conflicting-with-target-namesSteps/Code to reproduce bug
Expected behavior
Environment overview (please complete the following information)
Environment details Please run and paste the output of the
cudf/print_env.sh
script here, to gather any other relevant environment detailsClick here to see environment details
Additional context
We have a test coverage for this, it is disable currently: https://github.com/rapidsai/cudf/blob/fb03c8bc91e6a9de0605ff727da45e183f0fd5b5/python/cudf/cudf/tests/test_csv.py#L1278-L1286