apache / datafusion-comet

Apache DataFusion Comet Spark Accelerator
https://datafusion.apache.org/comet
Apache License 2.0
824 stars 165 forks source link

Add support for Iceberg #1028

Open andygrove opened 1 month ago

andygrove commented 1 month ago

What is the problem the feature request solves?

This issue is to track the status of Iceberg support in Comet.

There is a PR open against Iceberg to make use of Comet's native Parquet decoding:

https://github.com/apache/iceberg/pull/9841

Once this is merged and a new Iceberg release is available, Comet will be updated to support reading from Iceberg.

Describe the potential solution

No response

Additional context

It maybe possible to read from Iceberg today by enabling spark.comet.sparkToColumnar.enabled and adding the Iceberg scan operator to spark.comet.sparkToColumnar.supportedOperatorList but this has not been tested, and may not perform well.

viirya commented 1 month ago

Related issue: #974