Closed jusjosj closed 4 days ago
I am willing to submit a fix for this issue and have sometime next week to do so.
The way I plan to tackle this is to add stats fields for JsonWriter struct then hydrate them from the DeltaTable object in try_new or for_table. If the stats settings are not present, I will default the fields to the existing defaults.
Environment
Delta-rs version: 0.21.0
Binding: Rust
Environment:
Bug
What happened: JsonWriter and RecordBatchWriter do not respect
delta.dataSkippingStatsColumns
What you expected to happen:
delta.dataSkippingStatsColumns
should be able to limit stats collectionHow to reproduce it: Run the following test case for json.rs and observe
delta.dataSkippingStatsColumns
set properly, but the stats record containing values for all columns instead of just idMore details:
Issue is caused by this block of code in the flush function where it is hardcoding the stats_collection information when creating the add actions instead of pulling this information from the table.