apache / arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
https://arrow.apache.org/
Apache License 2.0
14.68k stars 3.56k forks source link

[C++] csv::TableReader column names, Read() arguments #26221

Open asfimport opened 4 years ago

asfimport commented 4 years ago

Some feature requests:

Reporter: Neal Richardson / @nealrichardson

Note: This issue was originally created as ARROW-10219. Please see the migration documentation for further details.

asfimport commented 4 years ago

Antoine Pitrou / @pitrou: I'm not sure I understand #1, can you explain a bit more? As for #2, by giving ConvertOptions::include_columns you can already restrict which columns you want to convert.

asfimport commented 4 years ago

Neal Richardson / @nealrichardson: I didn't know about include_columns, thanks.

Here's two use cases for being able to get the column names without reading the whole table:

asfimport commented 3 years ago

Antoine Pitrou / @pitrou: cc @westonpace

asfimport commented 3 years ago

Weston Pace / @westonpace: It would probably be column_names and not schema.  The table reader can do late inference so it may not know the final schema until the final table is read.  But column_names should be pretty straightforward to add.

asfimport commented 2 years ago

Todd Farmer / @toddfarmer: This issue was last updated over 90 days ago, which may be an indication it is no longer being actively worked. To better reflect the current state, the issue is being unassigned. Please feel free to re-take assignment of the issue if it is being actively worked, or if you plan to start that work soon.