Open jwongz opened 4 years ago
PTAL @july2993.
Error caused by invalid connection to the checkpoint database (verifying the roles of the IP addresses). There should probably be a retry loop in (*MysqlCheckPoint).Save
.
PTAL @july2993.
Error caused by invalid connection to the checkpoint database (verifying the roles of the IP addresses). There should probably be a retry loop in
(*MysqlCheckPoint).Save
.
There's no try now. It just fatal if failed to save checkpoint(default saved at the downstream db). I don't think the scale-in tikv at upstream is related to this problem
since tikv has be moved for two days,this log is nerver printed
{"log":"[2020/09/03 11:11:00.859 +08:00] [INFO] [syncer.go:251] [\"write save point\"] [ts=419195167884705793]\n","stream":"stdout","time":"2020-09-03T03:11:00.85920493Z"}
and this log print for two days:
{"log":"[2020/09/03 11:04:23.736 +08:00] [INFO] [client_batch.go:257] [\"batchRecvLoop re-create streaming fail\"] [target=xxx:10000] [error=\"rpc error: code = Unavailable desc = all SubConns are in TransientFailure, latest connection error: connection error: desc = \\\"transport: Error while dialing dial tcp xxx:10000: connect: connection refused\\\"\"]\n","stream":"stdout","time":"2020-09-03T03:04:23.736332662Z"}
After drainer-server restart, it's ok now
{"log":"[2020/09/03 11:04:23.736 +08:00] [INFO] [client_batch.go:257] [\"batchRecvLoop re-create streaming fail\"] [target=xxx:10000] [error=\"rpc error: code = Unavailable desc = all SubConns are in TransientFailure, latest connection error: connection error: desc = \\"transport: Error while dialing dial tcp xxx:10000: connect: connection refused\\"\"]\n","stream":"stdout","time":"2020-09-03T03:04:23.736332662Z"}
This is probably is the bug of tikv client https://github.com/pingcap/tidb/pull/10301. and is already fixed in the latest version.
But I not sure what it will cause drainer can no longer work forward and save checkpoint.
Bug Report
Please answer these questions before submitting your issue. Thanks!
migrate tikv
normal
drainer error
drainer -V
in terminal to get drainer's version)v3.0.5