Closed 474420502 closed 5 years ago
send tikv request error: rpc error: code = Unavailable desc = all SubConns are in TransientFailure, latest connection error: connection error: desc = "transport: Error while dialing dial tcp 192.168.6.101:20160: connect: connection refused", ctx: &{{787169 71 26} id:787169 end_key:"mDB:153\000\000\375\000\000\000\000\000\000\000hTable:10\377040\000\000\000\000\000\372" region_epoch:
上面是tidb.log日志. 这次tikv可以启动了, 轮到tidb启动不了.
滚动升级和非滚动升级我都试了. 都是卡着TIDB这个节点启动不了.
@474420502 From the following log, I can tell TiKV is not running:
2018/12/28 10:29:45.905 region_cache.go:482: [info] drop regions that on the store 1(192.168.6.101:20160) due to send request fail, err: rpc error: code = Unavailable desc = all SubConns are in TransientFailure, latest connection error: connection error: desc = "transport: Error while dialing dial tcp 192.168.6.101:20160: connect: connection refused"
Is there any error log in TiKV?
2018/12/28 09:54:31.047 INFO util.rs:406: connecting to PD endpoint: "http://192.168.6.104:2379" 2018/12/28 09:54:31.051 INFO util.rs:465: connected to PD leader "http://192.168.6.104:2379" 2018/12/28 09:54:31.051 WARN util.rs:186: heartbeat sender and receiver are stale, refreshing.. 2018/12/28 09:54:31.051 INFO util.rs:190: cancel region heartbeat sender 2018/12/28 09:54:31.051 WARN util.rs:205: updating PD client done, spent 12.273793ms 2018/12/28 09:54:31.059 ERRO util.rs:336: fail to request: Grpc(RpcFailure(RpcStatus { status: Unknown, details: Some("version should compatible with version 3.0.0-alpha, got 2.1.2") })) 2018/12/28 09:54:31.062 ERRO util.rs:336: fail to request: Grpc(RpcFailure(RpcStatus { status: Unknown, details: Some("version should compatible with version 3.0.0-alpha, got 2.1.2") })) 2018/12/28 09:54:31.067 ERRO util.rs:336: fail to request: Grpc(RpcFailure(RpcStatus { status: Unknown, details: Some("version should compatible with version 3.0.0-alpha, got 2.1.2") })) 2018/12/28 09:54:31.070 ERRO util.rs:336: fail to request: Grpc(RpcFailure(RpcStatus { status: Unknown, details: Some("version should compatible with version 3.0.0-alpha, got 2.1.2") })) 2018/12/28 09:54:31.079 ERRO util.rs:336: fail to request: Grpc(RpcFailure(RpcStatus { status: Unknown, details: Some("version should compatible with version 3.0.0-alpha, got 2.1.2") })) 2018/12/28 09:54:31.083 ERRO util.rs:336: fail to request: Grpc(RpcFailure(RpcStatus { status: Unknown, details: Some("version should compatible with version 3.0.0-alpha, got 2.1.2") })) 2018/12/28 09:54:31.091 ERRO util.rs:336: fail to request: Grpc(RpcFailure(RpcStatus { status: Unknown, details: Some("version should compatible with version 3.0.0-alpha, got 2.1.2") })) 2018/12/28 09:54:31.093 ERRO util.rs:336: fail to request: Grpc(RpcFailure(RpcStatus { status: Unknown, details: Some("version should compatible with version 3.0.0-alpha, got 2.1.2") })) 2018/12/28 09:54:31.097 ERRO util.rs:336: fail to request: Grpc(RpcFailure(RpcStatus { status: Unknown, details: Some("version should compatible with version 3.0.0-alpha, got 2.1.2") })) 2018/12/28 09:54:31.097 ERRO tikv-server.rs:235: failed to start node: Pd(Other(StringError("[src/pd/util.rs:344]: fail to request"))) 2018/12/28 09:58:16.315 ERRO store.rs:1164: Send compaction finished event to raftstore failed: Closed 2018/12/28 09:58:16.390 ERRO store.rs:1164: Send compaction finished event to raftstore failed: Closed 2018/12/28 09:58:17.593 ERRO store.rs:1164: Send compaction finished event to raftstore failed: Closed @shenli 我觉得最有价值的就是这个了.
"version should compatible with version 3.0.0-alpha, got 2.1.2" is the root cause. It means the cluster is downgraded not upgraded. There is a simple way to fix it.
Release Version: v2.1.0-rc.3-381-g7108881 Git Commit Hash: 71088815e7c5121e069d7ec10595d176a44b9bea Git Branch: master UTC Build Time: 2018-12-27 09:17:13 GoVersion: go version go1.11.2 linux/amd64 Race Enabled: false TiKV Min Version: 2.1.0-alpha.1-ff3dd160846b7d1aed9079c389fc188f7f5ea13e Check Table Before Drop: false
Release Version: 3.0.0-alpha Git Commit Hash: 72bf02f8a71fef7d66c69deb1f9e0b25ba49a9ee Git Commit Branch: master UTC Build Time: 2018-12-27 03:07:19 Rust Version: rustc 1.29.0-nightly (4f3c7a472 2018-07-17)
Release Version: v3.0.0-alpha-74-ga573364 Git Commit Hash: a5733642e029d09a50651f2881c35737dcfc7512 Git Branch: master UTC Build Time: 2018-12-27 02:22:05 我是更新了一次2.1.0 rc版本后一直追着更新, 然后看到有2.1release版本后想更新到这个v2.1.2版本
@474420502 You can use pd-ctl
to downgrade the cluster version:
>> config set cluster-version <your_version>
BTW, the usage of pd-ctl
can be found in this documentation.
那我能不升级吗, 就留着这个? 等以后有一个release版本比这个新了, 然后我更新到稳定版本. @rleungx
Bug Report
Please answer these questions before submitting your issue. Thanks!
Dec 28 10:23:48 vnfs-ag-01 systemd[1]: Started tidb-4000 service. root@vnfs-ag-01:/data/deploy/log# systemctl status tidb-4000.service ● tidb-4000.service - tidb-4000 service Loaded: loaded (/etc/systemd/system/tidb-4000.service; disabled; vendor preset: enabled) Active: active (running) since Fri 2018-12-28 10:23:48 CST; 18s ago Main PID: 12556 (tidb-server) Tasks: 44 (limit: 4915) CGroup: /system.slice/tidb-4000.service └─12556 bin/tidb-server -P 4000 --status=10080 --advertise-address=192.168.6.101 --
Dec 28 10:23:48 vnfs-ag-01 systemd[1]: Started tidb-4000 service. root@vnfs-ag-01:/data/deploy/log# systemctl status tidb-4000.service ● tidb-4000.service - tidb-4000 service Loaded: loaded (/etc/systemd/system/tidb-4000.service; disabled; vendor preset: enabled) Active: active (running) since Fri 2018-12-28 10:23:48 CST; 19s ago Main PID: 12556 (tidb-server) Tasks: 44 (limit: 4915) CGroup: /system.slice/tidb-4000.service └─12556 bin/tidb-server -P 4000 --status=10080 --advertise-address=192.168.6.101 --
Dec 28 10:23:48 vnfs-ag-01 systemd[1]: Started tidb-4000 service. root@vnfs-ag-01:/data/deploy/log# systemctl status tidb-4000.service ● tidb-4000.service - tidb-4000 service Loaded: loaded (/etc/systemd/system/tidb-4000.service; disabled; vendor preset: enabled) Active: activating (auto-restart) (Result: exit-code) since Fri 2018-12-28 10:24:08 CST; 63 Process: 12556 ExecStart=/data/deploy/scripts/run_tidb.sh (code=exited, status=1/FAILURE) Main PID: 12556 (code=exited, status=1/FAILURE)
这次tidb-ansible Merge branch 'release-2.1' of https://github.com/pingcap/tidb-ansible into my 这次轮到 tidb启动不了.