etcd-io / etcd

Distributed reliable key-value store for the most critical data of a distributed system
https://etcd.io
Apache License 2.0
47.52k stars 9.74k forks source link

Separate Robustness compaction and LazyFS test scenario #18144

Open serathius opened 3 months ago

serathius commented 3 months ago

Which Github Action / Prow Jobs are flaking?

https://testgrid.k8s.io/sig-etcd-robustness#ci-etcd-robustness-amd64

Which tests are flaking?

TestRobustnessExploratory/Kubernetes/LowTraffic/ClusterOfSize1/LazyFS TestRobustnessExploratory/Etcd/LowTraffic/ClusterOfSize1/LazyFS

Github Action / Prow Job link

https://prow.k8s.io/view/gs/kubernetes-jenkins/logs/ci-etcd-robustness-amd64/1799048278302527488

Reason for failure (if possible)

Since introduction of compaction in https://github.com/etcd-io/etcd/pull/17833 we started getting traffic.go:129: Requiring minimal 100.000000 qps before failpoint injection for test results to be reliable, got 89.482132 qps

I would like to propose to create a separate Compaction and LazyFS as they might be too heavy to CI. For example separate TestRobustnessExploratory/Etcd/LowTraffic/ClusterOfSize1 into:

TODO:

Anything else we need to know?

No response

serathius commented 3 months ago

cc @MadhavJivrajani @siyuanfoundation @henrybear327 @fuweid