upgundecha / howtheysre

A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Creative Commons Zero v1.0 Universal
9.13k stars 780 forks source link

Added a blog post from Chaos Mesh #110

Closed danidanz closed 2 years ago

danidanz commented 2 years ago

I would like to add a blog post about chaos engineering from Chaos Mesh.

How to Develop a Daily Reporting System to Track Chaos Testing Results

Summary

Chaos Mesh enables you to simulate faults that most cloud-native applications might encounter. In this article, I created a PodChaos experiment and observed that QPS in the TiDB cluster was affected when the Pod became unavailable. After analyzing the logs, I can enhance the robustness and high availability of the system. I built a web application to generate daily reports for troubleshooting and debugging. You can also customize the reports to meet your own requirements.

upgundecha commented 2 years ago

Hi @danidanz thanks for the PR. However, this is not a relevant post for this repo. Please consider adding it to https://github.com/dastergon/awesome-chaos-engineering