Open FenilJain2301 opened 6 months ago
@FenilJain2301 dont see why the extra test/
is added to the path
looking at the examples in iceberg documentation following two settings should not be necessary. could you remove them and try again? especially the first one.
debezium.sink.iceberg.s3.endpoint=https://test.s3.amazonaws.com
debezium.sink.iceberg.s3.path-style-access=true
And what happens when you provide the warehouse name like below? does it returns data when you manually fix the table location?
debezium.sink.iceberg.warehouse=s3://test/test/warehouse
This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.
Hi @ismailsimsek, We are successfully able to create iceberg tables data and successfully push to the s3 bucket, But we are facing the issue that when the data is pushed to aws glue in the form of tables it is not accessible because it is taking only object URI and not object path with bucket URI. Suppose I give the endpoint of the bucket or bucket name "test" and debezium.sink.iceberg.warehouse=s3://test/warehouse, it should create a data in test bucket within the warehouse object. the test will be the bucket name and warehouse will be the bucket object and that path for the aws glue table "s3://test/warehouse" but what happened is it is creating a test object within that object it is creating warehouse means in bucket path, the path of data will be "test/test/warehouse" and glue data table path should be "s3://test/test/warehouse".
Problem is that the bucket path is "test/test/warehouse" and the glue path is for tables it's "s3://test/warehouse". So when we try to access object from the aws glue table, it is showing error: object is not found. We will share the application.properties file parameter below:
application.properties:
From github document, we think that it should take the bucket name and just create the object , but it creating the object under the object and AWS glue only considers s3a:// path only
Bucket data path. test/test/warehouse AWS glue Table path >> s3a://test/warehouse so its not able to find objects in bucket.
bucket URI: s3://dremio-virginia/test/default.db/debeziumcdc_tutorial_public_test AWS glue table uri for accessing bucket data :[s3://test/default.db/debeziumcdc_tutorial_public_test]
we attached the screenshots for your better understanding. Please revert us waiting for your reply.
Thanks, Mr. Fenil Jain