Open silvamerica opened 1 year ago
Hi @silvamerica,
Thanks for pointing this out!
The thought behind not tolerating the error was that it would be a non-recoverable issue with the http client or something. However, as it turns out, that's not the case - as you have highlighted.
Open PR at #474.
@pintohutch to clarify, will we lose any metrics being collected on that node if we encounter this crash? Or is this just isolated to the config-reloader sidecar?
Hey @mathe-matician - you will not lose metrics. This is just the config-reloader restarting because prometheus isn't ready to load config yet (presumably due to startup delay).
I've been investigating why the
config-reloader
pod occasionally restarts multiple times while booting, and I've narrowed it down to this line. I'm seeing the following in the logs:Per the comment on line 73, it seems like the intent would be to continue polling in this situation, but instead the process exits and has to be restarted.