Closed blackmidnight closed 3 months ago
This issue has been marked as stale due to 180 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the issue at any time. Thank you for your contributions.
This issue has been closed due to lack of activity. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.
Related Template(s)
cdc-embedded-connector
What happened?
We use cdc-embedded-connector to transfer a mysql table to gcp pubsub, At the beginning, this table only had hundreds of thousands of rows of records. The program normally executed the complete process from snapshot to binlog, but when the table increased to 10 million, we found that during the snapshot stage when scanning to There was an interruption at 4.85 million, and I tried many times. The number of rows scanned each time was different, there were 720,000, 1.76 million, etc., and the time for interruption was also different. There were 15 minutes, 30 minutes, etc. Check that there is no task exception information in the log. We checked the source code and debugged but couldn't find the reason. Whether it is the timeout of the mysql server or the OOM of the client, there should be corresponding error logs, which makes us puzzled. Our environment is: mysql: azure mysql 5.7 k8s: 1.21.14-gke.3000 offset storage: file The relevant log screenshot is as follows:
Beam Version
Newer than 2.35.0
Relevant log output