Seagate / cortx-k8s

CORTX Kubernetes Orchestration Repository
https://github.com/Seagate/cortx
Apache License 2.0
6 stars 47 forks source link

Add troubleshooting for server pod failure #356

Closed hessio closed 7 months ago

hessio commented 2 years ago

Please see detailed reason for creating this PR in #355

Signed-off-by: Patrick Hession patrick.hession@seagate.com

Description

Breaking change

Type of change

Applicable issues

CORTX image version requirements

This change requires the following images:

How was this tested?

Additional information

Checklist

If this change requires newer CORTX or third party image versions:

If this change addresses a CORTX Jira issue:


View rendered README.md

hessio commented 2 years ago

I think this needs a bit more work. A troubleshooting step should identify a specific problem and resolution. "Server pod fails during deployment" is too general of a problem, they could fail for a variety of reasons. Restarting the coredns pods won't address every situation like this, so this step should be clear about which specific failure case it applies to.

Ya good point, I have update the heading to: "CORTX Deployment failed due to "timeout waiting for cortx-server or cortx-data-g0 to initialize" " - do you think this is explanatory enough or no? cc: @keithpine

shailesh-vaidya commented 7 months ago

Closing as an obsolete