CCI-MOC / esi

Elastic Secure Infrastructure project
6 stars 12 forks source link

Add `resource_class` field to nodes #509

Closed tzumainn closed 5 months ago

tzumainn commented 7 months ago

We need to

tzumainn commented 6 months ago

@hakasapl @naved001 Do we have a sense of how we might want to categorize our current nodes? Let me know, and I can update the node manifests and the current nodes.

hakasapl commented 6 months ago

@tzumainn I think we had talked about naming by significant node differences, so we would use:

Not sure what we should call the various different hardware in OCT4 though. @naved001 thoughts?

naved001 commented 6 months ago

@hakasapl aren't the OCT4 servers some sort of Dell PowerEdge servers? So, if it's a PowerEdge R720xd we should call it that R720xd.

joachimweyl commented 6 months ago

@tzumainn what are the next steps for this issue?

tzumainn commented 6 months ago

I need a determination of how nodes should be categorized; I think resource_class will ultimately be used for pricing, so we may be blocked on https://github.com/CCI-MOC/ops-issues/issues/1271

joachimweyl commented 6 months ago

@larsks you had some great ideas of how to categorize the nodes, can you chime in?

joachimweyl commented 6 months ago

@msdisme what are your thoughts about bringing this to the Monday stratagy meeting to get feedback on what categories we would want to be billing on?

larsks commented 6 months ago

@joachimweyl I'm not sure that I did have a great idea here. Most of my interest in this was about the ability to filter nodes based on attributes, which is sort of orthogonal to this categorization question.

hakasapl commented 6 months ago

In my opinion it should be the shortened model of the machine:

fc830, fc430, r720xd, etc. Then for specifically different machines, like the ones with NVME drives, fc830-nvme etc.

Basically we only need a different resource class if the cost of running them is significantly different, which translates to a difference in performance. So I think model number works fine, then add extensions if we need to further differentiate for whatever reason

larsks commented 6 months ago

In my opinion it should be the shortened model of the machine:

What if we have two machines from two different vendors (dell and lenovo?) with roughly equivalent hardware -- should those really be two different resource classes? Or should resource_class more closely correlate with something like an openstack flavor -- that is, an alias for a certain amount of resources.

I don't know the answer myself.

tzumainn commented 6 months ago

In my opinion it should be the shortened model of the machine:

What if we have two machines from two different vendors (dell and lenovo?) with roughly equivalent hardware -- should those really be two different resource classes? Or should resource_class more closely correlate with something like an openstack flavor -- that is, an alias for a certain amount of resources.

I don't know the answer myself.

We're going to end up having both resource_class and node properties visible, so in my opinion a single resource class should correspond to a "useful" high-level categorization, like cost (with perhaps a special dump resource_class category for random pieces of hardware where cost depends on node properties). But yeah, this feels like a policy question.

hakasapl commented 6 months ago

@tzumainn I'm happy to make a PR for this unless you already have something working

tzumainn commented 6 months ago

@hakasapl oh, I don't - I was actually waiting for some sort of confirmation, but if you're confident on this you'll definitely be better than me at making sure nodes are labeled correctly!

hakasapl commented 6 months ago

PR submitted: https://github.com/CCI-MOC/esi-pilot/pull/57

tzumainn commented 5 months ago

PR merged, and nodes updated - thanks!