tikv / pd

Placement driver for TiKV
Apache License 2.0
1.05k stars 720 forks source link

FATAL log when cluster start #7228

Open Lily2025 opened 11 months ago

Lily2025 commented 11 months ago

Bug Report

What did you do?

start new cluster

What did you expect to see?

no fatal log

What did you see instead?

FATAL log when cluster start [2023/10/18 21:04:34.288 +08:00] [FATAL] [main.go:250] ["join meet error"] [error="etcdserver: unhealthy cluster"] [stack="main.start\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/pd/cmd/pd-server/main.go:250\nmain.createServerWrapper\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/pd/cmd/pd-server/main.go:190\ngithub.com/spf13/cobra.(Command).execute\n\t/go/pkg/mod/github.com/spf13/cobra@v1.0.0/command.go:846\ngithub.com/spf13/cobra.(Command).ExecuteC\n\t/go/pkg/mod/github.com/spf13/cobra@v1.0.0/command.go:950\ngithub.com/spf13/cobra.(*Command).Execute\n\t/go/pkg/mod/github.com/spf13/cobra@v1.0.0/command.go:887\nmain.main\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/pd/cmd/pd-server/main.go:70\nruntime.main\n\t/usr/local/go/src/runtime/proc.go:267"]

What version of PD are you using (pd-server -V)?

[2023/10/18 21:04:39.381 +08:00] [INFO] [versioninfo.go:89] ["Welcome to Placement Driver (PD)"] [2023/10/18 21:04:39.381 +08:00] [INFO] [versioninfo.go:90] [PD] [release-version=v7.5.0-alpha] [2023/10/18 21:04:39.381 +08:00] [INFO] [versioninfo.go:91] [PD] [edition=Community] [2023/10/18 21:04:39.381 +08:00] [INFO] [versioninfo.go:92] [PD] [git-hash=cb9c70c6edc48f07ad617d18afce1bb080925781] [2023/10/18 21:04:39.381 +08:00] [INFO] [versioninfo.go:93] [PD] [git-branch=heads/refs/tags/v7.5.0-alpha] [2023/10/18 21:04:39.381 +08:00] [INFO] [versioninfo.go:94] [PD] [utc-build-time="2023-10-18 11:36:48"] [2023/10/18 21:04:39.381 +08:00] [INFO] [metricutil.go:86] ["disable Prometheus push client"]

Lily2025 commented 11 months ago

/severity Moderate /assign lhy1024

rleungx commented 4 months ago

Anything we need to do about this issue? /cc @lhy1024

lhy1024 commented 4 months ago

Anything we need to do about this issue? /cc @lhy1024

I tried to reproduce it in tiup but with no success. This issue seems to be more likely to occur in operator.

This issue outputs fatal logs, but does not affect cluster-wide deployments.