Closed galipremsagar closed 3 years ago
na_filter
is partially functional - it disables the na_values
from being detected as null/NaN. Looks like what is missing is the change in logic to treat empty fields as null. Good catch.
There may be more that can be done with this parameter, like creating non-nullable output columns.
Describe the bug We have a parameter called
na_filter
which will decide whether there should be any nulls in the dataframe or not while reading a csv file content. This parameter seems to be non-functioning.Steps/Code to reproduce bug
Expected behavior When
na_filter
isFalse
we shouldn't actually be having any nulls in the dataframe and rest of the empty values would be read as empty strings pandas does.Environment overview (please complete the following information)
Environment details Please run and paste the output of the
cudf/print_env.sh
script here, to gather any other relevant environment detailsClick here to see environment details
Additional context Surfaced while running fuzz tests: #6001