datafusion-contrib / datafusion-orc

Implementation of Apache ORC file format use Apache Arrow in-memory format
Apache License 2.0
30 stars 8 forks source link

Write ORC from arrow recordbatches #11

Open Jefffrey opened 8 months ago

Jefffrey commented 8 months ago

Not a focus now, just raising issue here for tracking

Currently in progress.

Initial support

Tracked by initial-write-support branch

Checklist:

Once complete will raise PR for all the above, to provide a complete and usable writer (though lacking in features see below).

Subsequent features

Following items will be added in smaller PRs once base code of writer is merged to main.

Jefffrey commented 1 month ago

Beginning work on this.

I'll be committing to the initial-write-support branch.

Will want to get a minimum end to end version before merging into main (so resultant PR might be big), supporting basic types like string/integer/float etc.