apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.46k stars 969 forks source link

[core] Optimize IncrementalStartingScanner to randomlyExecute #4229

Closed JingsongLi closed 2 months ago

JingsongLi commented 2 months ago

Purpose

Sequence batch should not be used; it only optimizes memory, and we only need to return the results in order.

Randomly execution can improve performance.

Tests

API and Format

Documentation

wwj6591812 commented 2 months ago

+1