Open ahurtaud opened 1 month ago
I think it is a valid issue. Help wanted.
After a discussion with @MichaHoffmann, we came to realise that sidecar crashing can be useful for some users that rely on it to "detect" when something is wrong (like an uninitialised S3 bucket).
While it was suggested to add a metric to alert on the situation, such situations go could unnoticed.
I suggest to let sidecar crash by default and add an option to allow sidecar to continue to serve prometheus read path even if the objstore is not working.
Is your proposal related to a problem?
Also related to objstore project.
We had a network outage accessing our storage endpoint. (DNS failure) when sidecar restarted it then go into crashloop with :
While we consider objectstorage for long term metrics only, we would like sidecar to continue to serve prometheus read path and not crash.
Describe the solution you'd like
Could this error become a warning. And we would alert on a failing metrics or so instead of crashing.
Additional context
Thanos v0.35.0 ObjStore Azure