canonical / charmed-aether-sd-core

Charmed Aether SD-Core is a secure, reliable and observable open source 5G private mobile network.
https://canonical-charmed-aether-sd-core.readthedocs-hosted.com/en/latest/
Apache License 2.0
2 stars 2 forks source link

docs: add troubleshooting how-to for deployment issues #40

Closed gatici closed 3 days ago

gatici commented 1 week ago

Description

This PR adds a troubleshooting how-to for Charmed Aether SD-Core deployment problems.

Checklist:

gatici commented 1 week ago

My general impression is that there's not much actual troubleshooting here. We're either telling the users to create a bug report or to read the manual again. Do we actually need this guide?

What is your proposal that this deployment troubleshooting how-to should include ?

Gmerold commented 1 week ago

My general impression is that there's not much actual troubleshooting here. We're either telling the users to create a bug report or to read the manual again. Do we actually need this guide?

What is your proposal that this deployment troubleshooting how-to should include ?

Personally I think we don't need it.

gruyaume commented 1 week ago

My general impression is that there's not much actual troubleshooting here. We're either telling the users to create a bug report or to read the manual again. Do we actually need this guide?

What is your proposal that this deployment troubleshooting how-to should include ?

Personally I think we don't need it.

My general impression is that there's not much actual troubleshooting here. We're either telling the users to create a bug report or to read the manual again. Do we actually need this guide?

What is your proposal that this deployment troubleshooting how-to should include ?

Personally I think we don't need it.

@Gmerold Is your recommendation to not have any troubleshooting guide at all?

I'd be ok with this if we think they're not useful. I don't want to add more documentation for no reason.

Gmerold commented 1 week ago

My general impression is that there's not much actual troubleshooting here. We're either telling the users to create a bug report or to read the manual again. Do we actually need this guide?

What is your proposal that this deployment troubleshooting how-to should include ?

Personally I think we don't need it.

My general impression is that there's not much actual troubleshooting here. We're either telling the users to create a bug report or to read the manual again. Do we actually need this guide?

What is your proposal that this deployment troubleshooting how-to should include ?

Personally I think we don't need it.

@Gmerold Is your recommendation to not have any troubleshooting guide at all?

I'd be ok with this if we think they're not useful. I don't want to add more documentation for no reason.

I'm not saying we shouldn't have any troubleshooting guides at all. In fact I think that having them can save us some time in the future. I just don't think we need this particular guide.

EDIT: If we want to keep it as a placeholder for describing typical deployment mistakes in the future, I'd reduce it to just the first point - the one about using an existing model name.

gatici commented 1 week ago

I would like to keep below cases as we do not provide enough information in our tutorials.

Gmerold commented 1 week ago

I would like to keep below cases as we do not provide enough information in our tutorials.

  • Juju model already exists
  • Terraform deploy getting i/o timeout

What could be the reason for getting the I/O timeout? Is it possible that after successful bootstrapping it will still fail?

gatici commented 1 week ago

I would like to keep below cases as we do not provide enough information in our tutorials.

  • Juju model already exists
  • Terraform deploy getting i/o timeout

What could be the reason for getting the I/O timeout? Is it possible that after successful bootstrapping it will still fail?

Yes, it can fail. Bootstrapping is done using a cloud. After a successful bootstrap process, if the cloud become unreachable for some reasons, Juju controller will be inaccessible and I/O timeout issue appears. Some reasons that make the cloud(microk8s) API unreachable:

Gmerold commented 1 week ago

Created a bunch of suggestions. Feel free to ignore them if you don't like it.

gatici commented 1 week ago

Created a bunch of suggestions. Feel free to ignore them if you don't like it.

I updated the PR and replied on comments.

gatici commented 4 days ago

Last minor comments. Other than that I think it's OK.

These comments are addressed in the latest commit.