databendlabs / databend

𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
https://docs.databend.com
Other
7.88k stars 751 forks source link

Feature: Support AWS Glue Catalog for Apache Iceberg #16793

Closed Rowlandev closed 5 days ago

Rowlandev commented 2 weeks ago

Databend currently supports rest and hms when creating an Apache Iceberg catalog. However, it's not supported for AWS Glue.

My proposition is to enable a query like the following, or somethings similar for users using Glue:

CREATE CATALOG analytics_testing
TYPE = ICEBERG
CONNECTION = (
    DATABASE = '<glue database name as it is in s3>'
        ROLE_ARN = '<iam role to assume>'
    REGION = '<region; go us-east-1>'
);

SHOW CREATE CATALOG analytics_testing;

Until this is implemented, users are given the following error:

iceberg catalog with type glue is not supported
sundy-li commented 2 weeks ago

Maybe we should implement it in iceberg-rs and cc @Xuanwo.

Xuanwo commented 2 weeks ago

Already supported in https://github.com/apache/iceberg-rust/blob/main/crates/catalog/glue/README.md.

We need to integrate it inside databend.

Rowlandev commented 5 days ago

This has been included in Databend with this pull request.

Thanks to @Xuanwo, @BohuTANG, @drmingdrmer & @sundy-li for the help.