Open knizhnik opened 1 year ago
I am not sure that it is a real problem, because this test is using small (1Mb) layer size, so producing large number of small layers. But I afraid that the same situation can happen with normal layers but huge database.
@knizhnik could you please supply more information: how many layers the size of the DB debug or release build
It happens both with debug and release builds, number of layers at the moment of crash ~7000 Actually the crash can be easily reproduced with the attached test
I don't see any changes in layer_coverage.rs that look like a fix for this, but the test doesn't fail -- the test code in this issue description is exactly like what is in test_gc_cutoff in main today (56171cbe8c2b81ba2b949a5ec39c11991fb5e47a), which doesn't hit a stack overflow.
I'm going to leave this ticket open: until we have a test that explicitly creates huge numbers of tiny layers and then checks compaction/GC still work properly, the issue might still be here.
Steps to reproduce
test_gc_dropped_relations.py:
Expected result
Increase pageserver storage size
Actual result
Crash at secon iteration with stack ovweflow:
Environment
Logs, links