A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
Add a one-click AWS Glue Job Runner that exposes a simplified singular CLI command for creating, configuring, and running an AWS Glue for Ray job against an Iceberg catalog with an integrated local or PyPi build of DeltaCAT.
As part of this story, we also want to ensure that we can read Iceberg tables into both Daft Dataframes and Ray Datasets within our Glue Job.
Add a one-click AWS Glue Job Runner that exposes a simplified singular CLI command for creating, configuring, and running an AWS Glue for Ray job against an Iceberg catalog with an integrated local or PyPi build of DeltaCAT.
As part of this story, we also want to ensure that we can read Iceberg tables into both Daft Dataframes and Ray Datasets within our Glue Job.