Closed lizdeika closed 9 months ago
This is not limited to tab-separated values, the same happens for CSV with commas as well.
Maybe this will help:
Setting missing_utf8_is_empty_string=True
Space(in first column) gets converted to empty string ""
instead of null
Looks like space is not recognized as utf8 char if it is a value of the first column
Similar whitespace-related CSV bugs: https://github.com/pola-rs/polars/issues/10587, https://github.com/pola-rs/polars/issues/12763
Seems I should fallback to pandas
@lizdeika Pull requests are welcome!
Not able to reproduce using polars 0.20.6 anymore; was fixed by: https://github.com/pola-rs/polars/pull/13934
Checks
[X] I have checked that this issue has not already been reported.
[X] I have confirmed this bug exists on the latest version of Polars.
Reproducible example
Log output
Issue description
Simple TSV file that has 4th and last rows' first column value as SPACE character Those spaces get converted to nulls. No problem for columns that are not first.
Expected behavior
Space is Space, not null
Installed versions