Open shamanthchandra-yb opened 1 month ago
Per @ttyusupov "Compactions tasks priorities are frequently changed because they are based on number of SST files in current RocksDB state. And that is causing pausing and transferring control to tasks with higher priority again and again. That was increasing the effect of going out of stable state, because that makes node to work slowly on almost all 150-250 background compactions switching between them instead of completing them one by one."
The following tserver gflags changes helped as it avoided frequent priority changes to compactions tasks and therefore pausing/resuming of compactions.
compaction_priority_step_size=10
compaction_priority_start_bound=20
Jira Link: DB-12659
Description
A potential bug has been observed in the YugabyteDB cluster where SST files are not being compacted as expected, and node n2 is experiencing frequent paused compactions along with high CPU usage. Please find slack thread in JIRA description.
Setup Details:
Configuration:
Observations:
Issue Type
kind/bug
Warning: Please confirm that this issue does not contain any sensitive information