apple / foundationdb

FoundationDB - the open source, distributed, transactional key-value store
https://apple.github.io/foundationdb/
Apache License 2.0
14.19k stars 1.29k forks source link

[Release-7.1] Cherry-pick Improve distributed consistency checker #11351

Closed kakaiu closed 2 months ago

kakaiu commented 2 months ago

Cherrypick https://github.com/apple/foundationdb/pull/11346

100K correctness test with 1 irrelevant failure: 20240502-004327-zhewang-5177db71281d373a compressed=True data_size=26926268 duration=5064278 ended=100000 fail=1 fail_fast=10 max_runs=100000 pass=99999 priority=100 remaining=0 runtime=1:08:02 sanity=False started=100000 stopped=20240502-015129 submitted=20240502-004327 timeout=5400 username=zhewang

Code-Reviewer Section

The general pull request guidelines can be found here.

Please check each of the following things and check all boxes before accepting a PR.

For Release-Branches

If this PR is made against a release-branch, please also check the following:

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-clang on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-cluster-tests on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-clang on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-cluster-tests on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-macos on macOS Ventura 13.x

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-macos-m1 on macOS Ventura 13.x

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-macos on macOS Ventura 13.x

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-clang on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-cluster-tests on Linux CentOS 7

kakaiu commented 2 months ago

In the CI failure, there is a storage server gets stuck at a version. As a result, ConsistencyCheckUrgent is always failed to read the replica from this storage server. Eventually, the trace event writes too much. Since this is not relevant to this PR, we rekick the CI test.

Reproducible by: ~/src/fdbserver-6-2-15 -r simulation -f ../src/foundationdb/tests/restarting/from_5.0.0_until_6.3.0/CycleTestRestart-1.txt --buggify on --seed 438152461 --logsize 1024MiB ../build_output/bin/fdbserver -r simulation -f ../src/foundationdb/tests/restarting/from_5.0.0_until_6.3.0/CycleTestRestart-2.txt --buggify on --seed 438152462 --logsize 1024MiB --restarting

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-macos-m1 on macOS Ventura 13.x

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-macos on macOS Ventura 13.x

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-clang on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr on Linux CentOS 7

foundationdb-ci commented 2 months ago

Result of foundationdb-pr-cluster-tests on Linux CentOS 7