Closed galipremsagar closed 2 years ago
is this the only failing test?
Yes, these are all the combinations that are failing with same error:
FAILED python/cudf/cudf/tests/test_parquet.py::test_parquet_reader_basic[brotli-10000-columns0-cudf]
FAILED python/cudf/cudf/tests/test_parquet.py::test_parquet_reader_basic[brotli-10000-columns1-cudf]
FAILED python/cudf/cudf/tests/test_parquet.py::test_parquet_reader_basic[brotli-10000-columns2-cudf]
FAILED python/cudf/cudf/tests/test_parquet.py::test_parquet_reader_basic[brotli-10000-columns3-cudf]
FAILED python/cudf/cudf/tests/test_parquet.py::test_parquet_reader_basic[brotli-10000-None-cudf]
based on this, it looks specific to brotli compression
does it repro consistently?
does it repro consistently?
Yup, fails on every run consistently.
Describe the bug When
cudf
is built withcuda-11.5
, there seems to be a data-corruption while reading a parquet file withcudf
engine. It seems to be the case for multiple columns, see example below.Steps/Code to reproduce bug
Expected behavior Data should be read correctly.
Environment overview (please complete the following information)
Environment details Please run and paste the output of the
cudf/print_env.sh
script here, to gather any other relevant environment detailsClick here to see environment details
Additional context Add any other context about the problem here.