Open maitredede opened 1 day ago
Hello, I have made some improvements in long installation durations :
By adding kafka provisionning parallelism, is it way faster (had to increase limits also, was OomKilled) :
kafka:
provisioning:
# replicationFactor: 3
parallel: 6
resources:
requests:
cpu: "100m"
memory: "1Gi"
limits:
cpu: "2"
memory: "4Gi"
I switched zookeeper-clickhouse
to a faster storageclass : snuba-migrate
was waaay faster (less than 2min from almost 1h).
There is still the job db-init
, because it operates on postgresql.
Since I would like to achieve high availability, I can easily scale kafka, zookeeper, clickhouse instances, but what about other components ? To have HA, I have at least to keep volumes on a replicated (slow) storageclass (ceph RBD), for postgresql, and contributing to performances bottleneck...
Issue submitter TODO list
Describe the bug (actual behavior)
When installing for the first time the sentry chart, the job
snuba-migrate
is taking a long time...Expected behavior
No response
values.yaml
Extract of my
values.yaml
:Helm chart version
From
develop
branch (commit 1782966c) :Steps to reproduce
Screenshots
No response
Logs
An small extract of logs, timestamps shows "create table" took 13s, and on first install there are lots of requests that takes this kind of delay
Additional context
Storage disks are slow :
ceph rbd
. I have moved some components to faster direct lvm disks (usingtopolvm
) : for sts clickhouse, but I don't know which volumes I should migrate (zookeeper-clickhouse ?) or which component I should increase replica count (at least for high availability).