apache / hudi

Upserts, Deletes And Incremental Processing on Big Data.
https://hudi.apache.org/
Apache License 2.0
5.43k stars 2.42k forks source link

[SUPPORT] Error upsetting bucketType UPDATE for partition :20240119 #10639

Open zjq888 opened 9 months ago

zjq888 commented 9 months ago

Hudi Flink Config: 'connector' = 'hudi', 'path' = 's3://bnb-datalake-hudi/**', 'table.type' = 'COPY_ON_WRITE', 'write.batch.size' = '512', 'write.tasks' = '4', 'write.bucket_assign.tasks' = '4', 'write.operation' = 'upsert', 'write.task.max.size' = '4096', 'write.merge.max_memory' = '3072', 'write.precombine' = 'true', 'precombine.field' = 'update_time', 'hive_sync.enable' = 'true', 'hive_sync.db' = '---', 'hive_sync.table' = '---', 'hive_sync.mode' = 'GLUE', 'hive_sync.partition_fields' = 'date_key', 'write.rate.limit' = '15000', 'hive_sync.partition_extractor_class' = 'org.apache.hudi.hive.MultiPartKeysValueExtractor'

It started to run normally, but after a period of execution, the following error will be reported(every sync will report more than 10 partitions) Exception trace during upsert:

@timestamp 2024-02-06T20:38:41.222Z,

ad1happy2go commented 8 months ago

@danny0405 Any insights here?