biosimulations / status

📈 Uptime monitor and status page for BioSimulations, powered by @upptime
https://status.biosimulations.org
MIT License
0 stars 1 forks source link

🛑 Combine API is down #17

Closed biosimulations-daemon closed 3 years ago

biosimulations-daemon commented 3 years ago

In 238ab71, Combine API (https://combine.api.biosimulations.org/kisao/get-similar-algorithms?algorithms=KISAO_0000019) was down:

bilalshaikh42 commented 3 years ago

Caused by memory pressure on nodes. Adding an additional node and clearing out older deployments

biosimulations-daemon commented 3 years ago

Resolved: Combine API is back up in 1db9245.

bilalshaikh42 commented 3 years ago

Due to memory pressure on the nodes, system workloads were not able to run, leading to pods being unschedulable and failing to terminate. The deployment tools were also unavailable leading to difficulty with managing the pods. Removing the node pool and recreating nodes

bilalshaikh42 commented 3 years ago

Node pool has been restarted. Argo deployment controller is loaded, and should being to load the applications again

bilalshaikh42 commented 3 years ago

Due to an error in configuration of the Argo controller, the deployment was unable to proceed automatically. The error has been fixed, and deployment should be rolling out now

bilalshaikh42 commented 3 years ago

All services should be back up