apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.43k stars 954 forks source link

[Feature] Support the json format which is used by the data integration of Aliyun Dataworks #4529

Closed JackeyLee007 closed 9 hours ago

JackeyLee007 commented 1 week ago

Search before asking

Motivation

We use aliyun dataworks and it's data integration to collect data from mysql, oracle etc. in realtime, then save the data into paimon with the paimon-flink-action jar. But the jar does not support the json format which is used by aliyun data integration.

Solution

Add new cdc json data parser for aliyun data integration.

Anything else?

No response

Are you willing to submit a PR?

JingsongLi commented 1 day ago

Feel free to open PR @JackeyLee007