nerc-project / operations

Issues related to the operation of the NERC OpenShift environment
2 stars 0 forks source link

Should we move GPUs from OpenStack to OpenShift? #668

Closed joachimweyl closed 3 months ago

joachimweyl commented 3 months ago

Motivation

We have 16 V100s at 4%-20% usage and 16 A100SXM4s at 0% usage in OpenStack. Is OpenShift a better location for them? Also worth noting is that we have a request for 4 SXM4 BM nodes pending from RH we might not even need to move them into OpenShift we can move them to ESI and then directly to BM for a while until RH is done with them and then move them into OpenShift Prod. We already have one request to move a V100 GPU out of OpenStack.

Completion Criteria

Decide if we should move some of the GPU nodes from OpenStack to OpenShift.

Description

Original Move Suggestion

Completion dates

Desired - 2024-08-09 Required - 2024-08-21

joachimweyl commented 3 months ago

@syockel your thoughts on this would be helpful

joachimweyl commented 3 months ago

If we can't come to a decision by next Wed I will bring it up in the Operations meeting.

joachimweyl commented 3 months ago

Decision made with Scott & Michael to move 8 V100s and 8 A100SXM4s out of OpenStack. 1 V100 to go to OpenShift testing cluster, 7 V100s to go to OpenShift production. all 8 A100SXM4s to go to OpenShift Production.

joachimweyl commented 3 months ago

680 created.