2i2c-org / infrastructure

Infrastructure for configuring and deploying our community JupyterHubs.
https://infrastructure.2i2c.org
BSD 3-Clause "New" or "Revised" License
106 stars 65 forks source link

catalyst-af: create hub-specific nodegroups #5117

Closed sgibson91 closed 3 days ago

sgibson91 commented 6 days ago
github-actions[bot] commented 6 days ago

Merging this PR will trigger the following deployment actions.

Support and Staging deployments

Cloud Provider Cluster Name Upgrade Support? Reason for Support Redeploy Upgrade Staging? Reason for Staging Redeploy
aws catalystproject-africa No Yes Following helm chart values files were modified: staging.values.yaml

Production deployments

Cloud Provider Cluster Name Hub Name Reason for Redeploy
aws catalystproject-africa nm-aist Following helm chart values files were modified: nm-aist.values.yaml
aws catalystproject-africa must Following helm chart values files were modified: must.values.yaml
aws catalystproject-africa uvri Following helm chart values files were modified: uvri.values.yaml
aws catalystproject-africa wits Following helm chart values files were modified: wits.values.yaml
aws catalystproject-africa kush Following helm chart values files were modified: kush.values.yaml
aws catalystproject-africa molerhealth Following helm chart values files were modified: molerhealth.values.yaml
aws catalystproject-africa aibst Following helm chart values files were modified: aibst.values.yaml
aws catalystproject-africa bhki Following helm chart values files were modified: bhki.values.yaml
aws catalystproject-africa bon Following helm chart values files were modified: bon.values.yaml
sgibson91 commented 6 days ago

Getting the same error here as on JMTE #5101

sgibson91 commented 3 days ago
2024-11-18 14:05:26 [✖]  found 1 nodegroup(s) (nb-bon-r5-16xlarge) without shared security group, cluster networking maybe be broken
2024-11-18 14:05:26 [✖]  it's recommended to create new nodegroups, then delete old ones
2024-11-18 14:05:26 [✖]  as a temporary fix, you can patch the configuration and add each of these nodegroup(s) to "sg-093ed905c45100bc1"
sgibson91 commented 3 days ago
Resource handler returned message: "The maximum number of rules per security group has been reached. (Service: Ec2, Status Code: 400, Request ID: 0e9ce1e3-bf07-4b15-a7cb-a6d80b617098)" (RequestToken: b9dff271-941d-ffdd-7bb2-44ec6d286986, HandlerErrorCode: ServiceLimitExceeded)
sgibson91 commented 3 days ago

I tried to resolve this by increasing our "Security groups per network interface" quota, but that did not mitigate the problem.

sgibson91 commented 3 days ago

Resolved by increasing the "Inbound or outbound rules per security group" quota

github-actions[bot] commented 3 days ago

:tada::tada::tada::tada:

Monitor the deployment of the hubs here :point_right: https://github.com/2i2c-org/infrastructure/actions/runs/11899615341