apache / arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
https://arrow.apache.org/
Apache License 2.0
14.28k stars 3.47k forks source link

[C++][Parquet] Parquet Arrow: Better interface for coerce_types handling #43810

Open mapleFU opened 2 weeks ago

mapleFU commented 2 weeks ago

Describe the enhancement requested

Currently, we have multiple string types and dictionary:

When reading from Parquet, we may want a different type than storage type, we may require a read coerce type.

Component(s)

C++, Parquet

mapleFU commented 2 weeks ago

Maybe related: https://github.com/apache/arrow/issues/43660