apache / arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
https://arrow.apache.org/
Apache License 2.0
14.27k stars 3.47k forks source link

[C++] [Dataset] The CSV file format currently always disables multithreading #29058

Open asfimport opened 3 years ago

asfimport commented 3 years ago

Previously the streaming CSV reader was a no-op and so it was a non-issue.  However, now that there is a parallel CSV reader it should be re-enabled.

 

https://github.com/apache/arrow/blob/81ff679c47754692224f655dab32cc0936bb5f55/cpp/src/arrow/dataset/file_csv.cc#L118

Reporter: Weston Pace / @westonpace

Note: This issue was originally created as ARROW-13387. Please see the migration documentation for further details.

asfimport commented 2 years ago

Todd Farmer / @toddfarmer: This issue was last updated over 90 days ago, which may be an indication it is no longer being actively worked. To better reflect the current state, the issue is being unassigned. Please feel free to re-take assignment of the issue if it is being actively worked, or if you plan to start that work soon.