apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
https://paimon.apache.org/
Apache License 2.0
2.35k stars 928 forks source link

[Umbrella] [Feature] Supported More CDC Ingestion #1870

Open MonsterChenzhuo opened 1 year ago

MonsterChenzhuo commented 1 year ago

Search before asking

Motivation

Currently, Paimon supports CDC Ingestion for MySQL, Kafka, and MongoDB. I'd like to further implement data ingestion for more databases using CDC. links:https://paimon.apache.org/docs/master/how-to/cdc-ingestion.

Solution

No response

Anything else?

No response

Are you willing to submit a PR?

SteNicholas commented 1 year ago

@MonsterChenzhuo, why does Paimon support more source? IMO, the Paimon is mained to support streaming lakehouse, not connector of different source .

MonsterChenzhuo commented 1 year ago

@SteNicholas I apologize for not being clear before. This question is about implementing write support for data lakes in a similar way to MySQL's Action. links:https://paimon.apache.org/docs/master/how-to/cdc-ingestion.

Currently, the databases I've listed are relatively stable in terms of Flink CDC synchronization support. Additionally, my company has this specific requirement. Therefore, I'd like to implement it and contribute to the community.

Alibaba-HZY commented 1 year ago

oracle cdc is also needed, because oracle to ogg-json is paid and expensive

MonsterChenzhuo commented 1 year ago

@Alibaba-HZY ok

s7monk commented 1 year ago

I can complete Postgres cdc. At present, I only need to run through the test. The local test cluster can run through. WIP

MonsterChenzhuo commented 1 year ago

@s7monk ok

fuyun2024 commented 1 year ago

We have used Flink Oracle CDC before, but there are still many issues with the lower version of Debezium. I do not recommend supporting it at this time.

hwhscut commented 11 months ago

Is support for Oracle and SQL Server under development?

medivh511 commented 4 months ago

Any plan for this? I hope we could do the oracle cdc directly from paimon