ray-project / deltacat

A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
Apache License 2.0
166 stars 23 forks source link

Allow DEFAULT_V2 when avg record size is absent #362

Closed raghumdani closed 1 month ago

raghumdani commented 1 month ago

This PR allows us to set DEFAULT_V2 option for both parquet and utsv files by handling the null for average record sizes.