Open joachimweyl opened 1 year ago
Is there a way to track rate of fill? Eg. how quickly are allocations being used up to project what we may need? (query only-not a request to implement this part)
Bringing back to strategy/roadmapping meeting for priority
Eg. extension of images store. @jtriley @waygil - in terms of eliminating service disruption. May be covered by observability.
@schwesig @computate this is already in the plan for Observability to cover. Can you investigate and post a followup here.
ACK
@joachimweyl with whom can I have a call about this to clarify. The 80GB are maybe an example, but close to some GPU memory sizes. So I want to be sure what memory, how you define it. I assume we are only talking about prod cluster and coldfront claimed projects?
@schwesig you can set up a meeting with @jtriley to gather information about storage totals. To gather information about usage I assume observability can gather that in some way. The 80GB piece discussed in the notes section is an example. essentially what it is trying to say is that Coldfront might say that someone is allocated 80GB or heck 8TB but that is not what their systems are currently using. For example one of those 80GB or 8TB drives might be actually using only 100MB of the space and that leaves 79.9GB or 7.9999TB the idea behind this issue is to find a way to calculate the requested value (coldfront) and then to also find out the amount actually in use (observability).
Motivation
We should be able to track storage and trigger a warning at say 10% storage remaining. Technically we can oversubscribe as volumes are thin subscribed and many projects will never use the full space allocated.
Completion Criteria
Calculate storage actually used vs allocated. Alert at 90% used.
Description
Notes
rbd du -p <POOLNAME>
should return the value we wantCompletion dates
Desired - early 2024 Required - mid 2024