apache / arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
https://arrow.apache.org/
Apache License 2.0
14.45k stars 3.52k forks source link

[Python] csv.ConvertOptions Do Not Pass Through/Retain Nullability from Schema #23903

Open asfimport opened 4 years ago

asfimport commented 4 years ago

 

Originally mentioned in: https://github.com/apache/arrow/issues/6243

High level description of the issue:

Minimal reproduction case:

Potential source of issue:

Environment: Reproduced on Ubuntu 18.04 and OSX Catalina in Python 3.7.4. Reporter: Tim Lantz

Note: This issue was originally created as ARROW-7655. Please see the migration documentation for further details.

asfimport commented 4 years ago

Tim Lantz: Re: my side note above, I filed https://issues.apache.org/jira/browse/ARROW-7656 as well. I see that in ARROW-6536 there is discussion on why in the C++ API you need to set both and that makes perfect sense so this is just a documentation thing.

asfimport commented 4 years ago

Joris Van den Bossche / @jorisvandenbossche: Currently, I think the column_types option is only meant to specify the types, while nullability is part of the Field in a Schema, and is not a fundamental property of the type itself.