Open NissesSenap opened 12 months ago
So I found a workaround for this.
I think the problem is that even though the config isn't used, value extraction happens in the mimir config.
I added
extraEnvFrom:
- secretRef:
name: mimir-etcd-auth
To both the querier and the ruler, which solved the issue. After this, I don't get the error again. I have no idea why this error doesn't happen on all mimir deployments.
I will keep this open and hopefully someone that knows the code base better than me can find the issue/document it.
Thanks for reporting this!
The querier uses some of the distributor code to find ingesters. As a result it ends up initializing the distributor module (DistributorService in the code). Since queriers (and all components in the Helm chart) use the same config, the distributor module in the querier pod ends up starting the HA tracker as well. I can't think of an easy way to fix this in code.
This should be mostly harmless. The querier will only have extra bandwidth following the HA Tracker keys in etcd, which are relatively low-volume anyways.
If you don't want to mount the secret on rulers, queriers, and query-frontends, then you can set this flag on each one of them individually in helm:
querier:
extraArgs:
distributor.ha-tracker.enable: false
ruler:
extraArgs:
...
Hi, fyi I ran into this with distributor too when using the helm chart, my mistake was assuming the env variable will be created with the ETCD_ROOT_PASSWORD
name with this config;
extraEnvFrom:
- name: ETCD_ROOT_PASSWORD
secretRef:
name: mimir-etcd
key: password
Turns out it wasn't, it used the key (for the name of the env variable) from the secret. But the error message was misleading because it was complaining about the username being empty but it's actually set in the config. When the password wasn't "set" because of expecting it under different env variable.
If I understand this issue correctly, it will be resolved by https://github.com/grafana/mimir/pull/9976
Describe the bug
Running the querier together with a distributor that uses etcd gives the following error over and over again. I can see the same error in the ruler
To Reproduce
Steps to reproduce the behavior:
Expected behavior
To my knowledge, the querier doesn't use etcd. So it shouldn't be there.
Environment
Additional Context
I use the mimir-distributed helm chart version 5.1.0
This is how a simplified version of my mimir values file look like.