Closed mita closed 1 year ago
Hi, thanks for reporting this. And thanks for posting the config!
We spent the last few weeks chasing this race condition down. It should now be fixed by f4e30a7f3c878bc119577a11d2ca69c72d41a925 and 08bf0b4e8510eb1646ccad057e6209b937b6fe89 which have been merged to our develop branch.
Let us know if you are still hitting this deadlock - and if you don't mind sharing - are you experimenting with tiered memory? If so, what does your environment look like?
Thank you. The problem doesn't happen anymore. We are planning to evaluate it with a tiered memory system, but we have not prepared an evaluation machine yet.
When I run cachebench with my test config, stress workers get stuck forever.
test.json.txt
Backtrace of the stress worker
FWIW, I can't reproduce this problem without commit 68e66394 ("Create token before marking item as exclusive").