A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
Refactored by adding for loops that support multiple rounds of hash bucketing and merging. Deltacat pytests pass with these changes. Please let me know if this need more modularization (this is more a implementation match to the POC code).
Refactored by adding for loops that support multiple rounds of hash bucketing and merging. Deltacat pytests pass with these changes. Please let me know if this need more modularization (this is more a implementation match to the POC code).