Closed matthewmturner closed 1 month ago
Updated the description a littlre
take
Cross posting here as it relates to duckdb functions https://github.com/apache/datafusion/issues/12254
hi @devanbenz just checking in on this - do you think youll have time to work on this? if not i can pick it up.
i have started work on this
Idea is to implement these metadata capabilities from duckdb
Originally posted by @alamb in https://github.com/datafusion-contrib/datafusion-dft/issues/125#issuecomment-2353468345
Here is
parquet_metadata
implementation in datafusion-cli:Implementation is here: https://github.com/apache/datafusion/blob/257e1409eca81cfff024ecc5e2567e9f67e6b5a3/datafusion-cli/src/functions.rs#L317-L459
I would like to suggest creating those functions in their own crate (perhaps datafusion-functions-parquet?) -- it could be in the datafusion-dft repo initially for convenience, but I think eventually the goal should be that dft just be focused on integration rather than actually implementing such features.
@matthewmturner says: