datafusion-contrib / datafusion-orc

Implementation of Apache ORC file format use Apache Arrow in-memory format
Apache License 2.0
30 stars 8 forks source link

Support non-struct root data type #47

Open Jefffrey opened 7 months ago

Jefffrey commented 7 months ago

ORC spec: https://orc.apache.org/docs/types.html

Hive always uses a struct with a field for each of the top-level columns as the root object type, but that is not required

See https://github.com/datafusion-contrib/datafusion-orc/pull/45 discussion

Jefffrey commented 3 months ago

Not even PyArrow supports this btw

So even lower priority