issues
search
hjacobs
/
kubernetes-failure-stories
Compilation of public failure/horror stories related to Kubernetes
https://k8s.af
6.23k
stars
309
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Added EKS networking war story
#54
yashmehrotra
closed
4 years ago
1
GitHub Availability Report: July 2020 (ImagePullPolicy Always)
#53
hjacobs
closed
4 years ago
1
adding conntrack failure story
#52
dmitri-lerko
closed
4 years ago
0
Add article about dns issues at Preply
#51
ghost
closed
4 years ago
2
Potential story: VPA recommender evicting large number of pods
#50
hjacobs
closed
4 years ago
0
Kubernetes Networking Problems Due to the Conntrack
#49
dmitri-lerko
closed
4 years ago
0
Adding `CPU limits and aggressive ...` blog.
#48
fayizk1
closed
4 years ago
1
Adding 'When GKE ran out of IP addresses' blog
#47
dmitri-lerko
closed
4 years ago
1
When GKE ran out of IP addresses
#46
hjacobs
closed
4 years ago
1
Monzo: fun failure stories promised
#45
hjacobs
closed
4 years ago
2
Sailing with the Istio through the shallow water
#44
hjacobs
closed
4 years ago
1
Story about failed integration of Istio
#43
kubaj
closed
5 years ago
3
Feature request - RSS feed
#42
sourcedelica
closed
5 years ago
2
Adevinta latency issue
#41
srvaroa
closed
5 years ago
1
Add dex/CR/bad defaults failure story
#40
pieterlange
closed
5 years ago
7
Prezi conntrack issues on EKS
#39
hjacobs
closed
5 years ago
0
fix markdownlint
#38
hjacobs
closed
5 years ago
0
Adds FREE NOW postmortem of kops container-selinux issue
#37
hrzbrg
closed
5 years ago
1
FreeNow: New K8s workers unable to join cluster
#36
hjacobs
closed
4 years ago
1
Add our ValidatingWebhookConfiguration story
#35
charlieegan3
closed
5 years ago
3
Introduce MarkdownLint
#34
femueller
closed
5 years ago
1
Update README.md
#33
ereli-cb
closed
5 years ago
0
Add Grafana Labs Post Mortem
#32
tomwilkie
closed
5 years ago
1
Grafana Production Outage Caused Using Kubernetes Pod Priorities
#31
hjacobs
closed
5 years ago
0
Node pool upgrade incident
#30
hjacobs
closed
4 years ago
1
#28 XING ContainerDays video
#29
hjacobs
closed
5 years ago
0
Moving to Kubernetes: the Bad and the Ugly - Maxime Lagresle
#28
hjacobs
closed
5 years ago
0
Stripe Learning to operate Kubernetes Reliably
#27
mmmries
closed
4 years ago
1
Create structure for people to contribute failure stories as Markdown
#26
hjacobs
closed
4 years ago
1
Bump on the (network) roads with Kubernetes
#25
Joacchim
closed
4 years ago
1
Algolia, killing a cluster with jobs
#24
ElPicador
closed
5 years ago
1
gravitational postgres experience
#23
jtolio
closed
5 years ago
2
Adds disclaimer to not use this list an excuse to stay away from Kubernetes
#22
kamranahmedse
closed
5 years ago
3
10 ways to shoot yourself in the foot with kubernetes, #9 will surprise you! (Container Day Paris)
#21
hjacobs
closed
4 years ago
2
KubeCon Datadog video
#20
hjacobs
closed
5 years ago
0
KubeCon Lightning Talk by Pusher
#19
hjacobs
closed
5 years ago
1
KubeCon Europe: Kubernetes Failure Stories
#18
hjacobs
closed
5 years ago
1
Kubernetes the very hard way (by Datadog) contains some lessons
#17
bgrant0607
closed
5 years ago
3
Add link to Skyscanner HTTP ingress postmortem
#16
gjtempleton
closed
5 years ago
1
Adding Loveholidays experience with GKE cluster upgrade
#15
dmitri-lerko
closed
5 years ago
1
Add Civis Analytics' outage blog post
#14
salilgupta1
closed
5 years ago
1
Add Moonlight's outage post-mortem
#13
philipithomas
closed
5 years ago
4
Adding "How NOT to do Kubernetes talk"
#12
medyagh
closed
5 years ago
3
Add Stories From Playbook talk
#11
povilasv
closed
5 years ago
5
Group by k8s release, etcd release, etc.
#10
max-lobur
closed
4 years ago
1
Add involved topics & impact
#9
hjacobs
closed
5 years ago
0
Adds Zalando talk from October 2017
#8
Raffo
closed
5 years ago
5
Small improvements to README.md
#7
johnlunney
closed
5 years ago
0
Failure story length and language
#6
marratj
closed
5 years ago
9
Update README.md
#5
kjgorman
closed
5 years ago
1
Next