cloudfuse-io / buzz-rust

Serverless query engine
MIT License
140 stars 11 forks source link

Add statistics to Parquet execution plan #14

Open rdettai opened 3 years ago

rdettai commented 3 years ago

The parquet execution plan used in Buzz (https://github.com/cloudfuse-io/buzz-rust/blob/master/code/src/execution_plan/parquet.rs) is a forked version of the one in DataFusion (https://github.com/apache/arrow/blob/master/rust/datafusion/src/physical_plan/parquet.rs).

The fork is outdated as DataFusion recently added the capability to read statistics from the parquet footer. The Buzz fork should thus be aligned to also support this functionality.