apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.12k stars 839 forks source link

[Bug] performance issue in CanalRecordParser #3571

Closed MOBIN-F closed 1 week ago

MOBIN-F commented 1 week ago

Search before asking

Paimon version

master

Compute Engine

flink

Minimal reproduce step

none

What doesn't meet your expectations?

The current CanalRecordParser#extractRowData has duplicate string parsing,which may cause performance problems when the data volume is large and there are many fields. image image

Anything else?

No response

Are you willing to submit a PR?