apache / arrow

Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
https://arrow.apache.org/
Apache License 2.0
14.37k stars 3.49k forks source link

[C++][Parquet] Add JSON canonical extension type #32538

Closed asfimport closed 3 weeks ago

asfimport commented 2 years ago

As a BigQuery developer, I would like the Arrow libraries to support the JSON logical Type. This would enable us to use the JSON type in the Arrow format of our ReadAPI. This would also enable us to use the JSON type to export data from BigQuery to Parquet.

Reporter: Pradeep Gollakota Watchers: Rok Mihevc / @rok

Related issues:

Note: This issue was originally created as ARROW-17255. Please see the migration documentation for further details.

asfimport commented 2 years ago

David Li / @lidavidm: Hey - I made a guess at the components, but you may want to follow up on the mailing list (dev@arrow.apache.org) with some more details (e.g. what languages you want to support, at least initially, and any capabilities such an extension type would have, beyond just wrapping a string). There have been other such discussions on 'common' extension types like UUIDs.

asfimport commented 2 years ago

Rok Mihevc / @rok: This is one of the threads: https://lists.apache.org/thread/3nls3222ggnxlrp0s46rxrcmgbyhgn8t

pitrou commented 3 weeks ago

Issue resolved by pull request 13901 https://github.com/apache/arrow/pull/13901