ray-project / deltacat

A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
Apache License 2.0
166 stars 23 forks source link

Sort by dw_last_updated before materializing #334

Open DmitriGekhtman opened 4 months ago

DmitriGekhtman commented 4 months ago

When handling compaction of an upsert delta, sort the table descending by dw_last_update_time prior to materializing the result.