kiwix / operations

Kiwix Kubernetes Cluster
http://charts.k8s.kiwix.org/
7 stars 0 forks source link

Week 38 2024 routine #255

Closed kiwixbot closed 2 months ago

kiwixbot commented 2 months ago

Check nodes free space

df -h / && df -h /data

Nodes system upgrades

apt update && apt upgrade

(regular workers updates are done separately on a monthly basis for worker nodes to not impact production)

Backups

k8s cluster

Stats

matomo - stats.kiwix.org

Grafana

Projects

Security

Note: this is an automatic reminder intended for the assignee(s).

rgaudin commented 2 months ago

Storage

Machine Filesystem Size Used Avail Use% Use change
bastion / 37G 15G 21G 42% -
stats / 233G 110G 111G 50% +1G
services / 456G 315G 118G 73% +4G
storage / 147G 23G 117G 16% +5G
storage /data 30T 18T 11T 63% +1T
imager-worker / 1.9T 568G 1.2T 32% don't care
sisyphus / 233G 23G 198G 11% don't care
ondemand / 25G 9.7G 14G 42% -
ondemand /data 216G 204M 205G 1% don't care
mirrors-qa / 38G 3.8G 32G 11% -1G
demo / 40G 9.5G 29G 23% -1G
demo /data 1.8T 925G 739G 56% don't care

misc

On Thursday 2024-09-12 at 10:25 a couple of pods restarted:

cert-manager is known to restart from time to time. The fact that at the same moment two other ones were restarted is interesting.


Jobs generation duration between 2024-09-13 and 2024-09-14 were significantly longer

Screenshot 2024-09-16 at 11 15 38

Cloud Signing: 1,035 left


Uptime robot incorrectly displays Kiwix Wiki as down (for a year!) but old dashboard is OK. I reported the issue to them (via their Chat)

zimit

I'm getting a bit concerned about Seed Page Load Failed. A lot of them seems to work fine on my browser. It seems some WAF are triggering this now.

benoit74 commented 2 months ago

@benoit74 why didn't you cancel those during last week's routine?

I simply forgot to do it. Probably got interrupted by something else...

I'm getting a bit concerned about Seed Page Load Failed. A lot of them seems to work fine on my browser. It seems some WAF are triggering this now.

This is not something really new, isn't it?

rgaudin commented 2 months ago

This is not something really new, isn't it?

It's zimit2, basically since we removed the python front-test I believe. The problems I see are:

benoit74 commented 2 months ago

we have no ticket nor documentation about this.

It is now documented at https://github.com/openzim/zimit/wiki/Frequently-Asked-Questions#the-zim-is-not-created-and-logs-says-seed-page-load-error