Currently HoodieMergedLogRecordScanner could use HoodiePreCombineAvroRecordMerger to merge log records, which treats all the records from the same batch, and does not respect the order of records across different batches. Such that commit time merging logic is broken.
Therefore, we use the configured merger instead.
Impact
Fix the bug for commit time merging.
Risk level (write none, low medium or high below)
Medium.
Documentation Update
Describe any necessary documentation update if there is any new feature, config, or user-facing change. If not, put "none".
The config description must be updated if new configs are added or the default value of the configs are changed
Any new feature or user-facing change requires updating the Hudi website. Please create a Jira ticket, attach the
ticket number here and follow the instruction to make
changes to the website.
Change Logs
Currently
HoodieMergedLogRecordScanner
could useHoodiePreCombineAvroRecordMerger
to merge log records, which treats all the records from the same batch, and does not respect the order of records across different batches. Such that commit time merging logic is broken.Therefore, we use the configured merger instead.
Impact
Fix the bug for commit time merging.
Risk level (write none, low medium or high below)
Medium.
Documentation Update
Describe any necessary documentation update if there is any new feature, config, or user-facing change. If not, put "none".
Contributor's checklist