apache / arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
https://arrow.apache.org/
Apache License 2.0
14.36k stars 3.49k forks source link

[Python][Documentation] Add column limit recommendations Parquet page #24897

Open asfimport opened 4 years ago

asfimport commented 4 years ago

Users would be well advised to not write columns with large numbers (> 1000) of columns

Reporter: Wes McKinney / @wesm

Note: This issue was originally created as ARROW-8746. Please see the migration documentation for further details.

asfimport commented 4 years ago

Maarten Ballintijn / @maartenb: @wesm, you've mentioned this before and as this is a not uncommon use-case can you maybe expand a bit on the following related questions. (use-cases include daily or minute data for a few 10's of thousands items like stocks or other financial instruments, IoT sensors, etc).

asfimport commented 4 years ago

Wes McKinney / @wesm: I appreciate the interest but perhaps let's discuss on the Parquet dev mailing list where more people can offer their advice and recommendations.

asfimport commented 4 years ago

Maarten Ballintijn / @maartenb: Redirected as requested.