Open scottyeager opened 2 days ago
This makes it difficult to assess whether the system is in a degraded state.
While i agree that it is something that should be improved. I don't think that assessing by listing all stored files is a good idea for these reasons:
i think exposing repair/rebuild queue would be enough
I can do without the list command, though I do think it would be handy for both human and machine consumption under different circumstances.
Exposing info on the repair queue would be fine. One thing I think is important though is that there's a way to get at the info both from CLI and via Prometheus.
One thing I think is important though is that there's a way to get at the info both from CLI and via Prometheus.
yes, fully agree with this
Currently there is no way to query the list of stored files and also no way to see the health of individual files in terms of how many shards they have stored in healthy backends. This makes it difficult to assess whether the system is in a degraded state. It's especially relevant when recovering from some backend failure to be able to check if all files have been rebuilt onto newly supplied backends. Being able to see a list of stored files is also helpful for general inspection of the system without needing to run lots of
check
commands and also keep a separate list of files that have been removed from local storage.So I'm thinking of something like this:
list
command that lists the stored fileslist
orcheck
or both)expected shards
available, if not do they at least haveminimal shards
available)