jazzband / tablib

Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
https://tablib.readthedocs.io/
MIT License
4.58k stars 589 forks source link

Suggestion: Option to normalize treatment of missing values #512

Open rosensama opened 2 years ago

rosensama commented 2 years ago

A Dataset from an Excel file will store None for any empty cell. In CSV the convention is to not distinguish between ,, and ,"", and treat all empty cells as empty strings. It would be nice to add a feature like that in pandas.read_csv() where ,, is treated as NaN, but None here for symmetry with Excel. I know that toggling the na_filter argument in read_csv() is one way to govern this in pandas.

Anyways, thanks for the useful tool.