redpanda-data / redpanda

Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
https://redpanda.com
9.69k stars 591 forks source link

datalake/configs: rework iceberg enablement configs #24222

Closed bharathv closed 1 day ago

bharathv commented 4 days ago

Additionally includes a fix for potentially translation gaps due to compaction.

Normally gaps shouldn't happen with translation due to the enforcement of max_collectible_offset. However the following sequence of actions can create a gap.

iceberg enabled iceberg disabled <-- compaction --> iceberg enabled

This is an unfixable gap if compaction cleaned up an offset range adjacent to last translated offset. The fix just plugs the gap by adjusting the begin offset of the range that is to be committed with the coordinator. This is a rare case but can result in a stuck translation if it happens.

Backports Required

Release Notes

bharathv commented 3 days ago

/dt

vbotbuildovich commented 3 days ago

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58481#01935047-8947-485f-b31f-5cf421174d22:

"rptest.tests.datalake.rest_catalog_connection_test.RestCatalogConnectionTest.test_redpanda_connection_to_rest_catalog.cloud_storage_type=CloudStorageType.S3.use_serde_parquet=True"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58481#01935047-8945-44ae-b536-dadf8713a3c5:

"rptest.tests.datalake.rest_catalog_connection_test.RestCatalogConnectionTest.test_redpanda_connection_to_rest_catalog.cloud_storage_type=CloudStorageType.S3.use_serde_parquet=False"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58481#0193504a-8a2c-4b19-b2e1-5b8e7fc58e5c:

"rptest.tests.datalake.rest_catalog_connection_test.RestCatalogConnectionTest.test_redpanda_connection_to_rest_catalog.cloud_storage_type=CloudStorageType.S3.use_serde_parquet=True"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58481#0193504a-8a38-4159-892b-1ca0aa2b864c:

"rptest.tests.datalake.rest_catalog_connection_test.RestCatalogConnectionTest.test_redpanda_connection_to_rest_catalog.cloud_storage_type=CloudStorageType.S3.use_serde_parquet=False"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58523#0193520a-4f69-4593-946f-6551b73f4474:

"rptest.tests.datalake.coordinator_retention_test.CoordinatorRetentionTest.test_retention.cloud_storage_type=CloudStorageType.S3"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58523#019351f7-0f3c-4f02-9cce-e4183c526459:

"rptest.tests.datalake.coordinator_retention_test.CoordinatorRetentionTest.test_retention.cloud_storage_type=CloudStorageType.S3"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58574#019354eb-5466-4a19-9021-bca8ae9f8e74:

"rptest.tests.datalake.coordinator_retention_test.CoordinatorRetentionTest.test_retention.cloud_storage_type=CloudStorageType.S3"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58594#019355a1-4bf1-4961-8219-5c104eb4c759:

"rptest.tests.polaris_catalog_smoke_test.PolarisCatalogSmokeTest.test_connecting_to_catalog.cloud_storage_type=CloudStorageType.S3.with_tls=True"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58594#019355a1-4bf6-41e6-9905-2aaa9caf93ce:

"rptest.tests.polaris_catalog_smoke_test.PolarisCatalogSmokeTest.test_connecting_to_catalog.cloud_storage_type=CloudStorageType.S3.with_tls=False"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58614#01935651-afe5-4198-a64e-6c0eb7a09dbf:

"rptest.tests.polaris_catalog_smoke_test.PolarisCatalogSmokeTest.test_connecting_to_catalog.cloud_storage_type=CloudStorageType.S3.with_tls=False"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58614#01935651-afde-4afe-a063-993d2eda99b3:

"rptest.tests.polaris_catalog_smoke_test.PolarisCatalogSmokeTest.test_connecting_to_catalog.cloud_storage_type=CloudStorageType.S3.with_tls=True"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58614#01935656-ff17-402f-abe2-15b67488e8d0:

"rptest.tests.polaris_catalog_smoke_test.PolarisCatalogSmokeTest.test_connecting_to_catalog.cloud_storage_type=CloudStorageType.S3.with_tls=False"

non flaky failures in https://buildkite.com/redpanda/redpanda/builds/58614#01935656-ff1a-4173-a45e-e2123a263309:

"rptest.tests.polaris_catalog_smoke_test.PolarisCatalogSmokeTest.test_connecting_to_catalog.cloud_storage_type=CloudStorageType.S3.with_tls=True"
vbotbuildovich commented 3 days ago

Retry command for Build#58481

please wait until all jobs are finished before running the slash command

/ci-repeat 1
tests/rptest/tests/datalake/rest_catalog_connection_test.py::RestCatalogConnectionTest.test_redpanda_connection_to_rest_catalog@{"cloud_storage_type":1,"use_serde_parquet":true}
tests/rptest/tests/datalake/rest_catalog_connection_test.py::RestCatalogConnectionTest.test_redpanda_connection_to_rest_catalog@{"cloud_storage_type":1,"use_serde_parquet":false}
vbotbuildovich commented 3 days ago

ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/58481#01935047-8943-4f97-8a01-8d23d8300b36 ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/58481#0193504a-8a2c-4b19-b2e1-5b8e7fc58e5c ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/58523#019351f7-0f3d-487a-952d-abffbe431a71 ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/58523#0193520a-4f6d-4b79-a245-87922ff91240 ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/58523#0193520a-4f6b-4f8c-bab8-87df84c25535 ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/58523#019351f7-0f38-4dd9-9639-3f39bd242cde

vbotbuildovich commented 2 days ago

Retry command for Build#58523

please wait until all jobs are finished before running the slash command

/ci-repeat 1
tests/rptest/tests/datalake/coordinator_retention_test.py::CoordinatorRetentionTest.test_retention@{"cloud_storage_type":1}
tests/rptest/tests/datalake/datalake_e2e_test.py::DatalakeE2ETests.test_e2e_basic@{"cloud_storage_type":1,"filesystem_catalog_mode":true,"query_engine":"spark"}
tests/rptest/tests/datalake/datalake_e2e_test.py::DatalakeE2ETests.test_e2e_basic@{"cloud_storage_type":1,"filesystem_catalog_mode":true,"query_engine":"trino"}
bharathv commented 2 days ago

A new fix landed in dev that conflicts with this PR, rebasing shortly.

vbotbuildovich commented 2 days ago

Retry command for Build#58574

please wait until all jobs are finished before running the slash command

/ci-repeat 1
tests/rptest/tests/datalake/coordinator_retention_test.py::CoordinatorRetentionTest.test_retention@{"cloud_storage_type":1}
vbotbuildovich commented 2 days ago

Retry command for Build#58594

please wait until all jobs are finished before running the slash command

/ci-repeat 1
tests/rptest/tests/polaris_catalog_smoke_test.py::PolarisCatalogSmokeTest.test_connecting_to_catalog@{"cloud_storage_type":1,"with_tls":true}
tests/rptest/tests/polaris_catalog_smoke_test.py::PolarisCatalogSmokeTest.test_connecting_to_catalog@{"cloud_storage_type":1,"with_tls":false}
bharathv commented 2 days ago

Retry command for Build#58594

please wait until all jobs are finished before running the slash command

/ci-repeat 1
tests/rptest/tests/polaris_catalog_smoke_test.py::PolarisCatalogSmokeTest.test_connecting_to_catalog@{"cloud_storage_type":1,"with_tls":true}

Another new test added during rebase that was referring to a removed config 💀

vbotbuildovich commented 2 days ago

Retry command for Build#58614

please wait until all jobs are finished before running the slash command

/ci-repeat 1
tests/rptest/tests/polaris_catalog_smoke_test.py::PolarisCatalogSmokeTest.test_connecting_to_catalog@{"cloud_storage_type":1,"with_tls":false}
tests/rptest/tests/polaris_catalog_smoke_test.py::PolarisCatalogSmokeTest.test_connecting_to_catalog@{"cloud_storage_type":1,"with_tls":true}
vbotbuildovich commented 1 day ago

/backport v24.3.x

vbotbuildovich commented 1 day ago

Failed to create a backport PR to v24.3.x branch. I tried:

git remote add upstream https://github.com/redpanda-data/redpanda.git
git fetch --all
git checkout -b backport-pr-24222-v24.3.x-765 remotes/upstream/v24.3.x
git cherry-pick -x aa17e5e6c6 5cbd42abf8 882af4db57 e0fabee560 54c42df9d1 8eca184766 8f0fc0494f 23829f562d 1ba9882e8d a81ee528ef a87e6ce9c7 2d923519b6

Workflow run logs.