CCI-MOC / ops-issues

2 stars 0 forks source link

Purchase Nvidia BlueField 2 DPUs #212

Closed larsks closed 2 years ago

larsks commented 3 years ago

Based on discusions with Nvidia and Red Hat we are planning to purchase 20 MBF2H332A-AEEOT BlueField 2 DPUs. These are HHHL cards (half height, half length) with dual 25Gb ports.

larsks commented 3 years ago

After not hearing back from Nvidia since our initial conversation on March 1, I reached out to multiple folks on Friday (3/19) and Monday (3/22). I heard back from Brandon Hathaway:

Hi Lars – Checking on this and will circle back. I think it got held up.

larsks commented 3 years ago

Today we finally received the quote from CDW. I have the quote and I have forwarded a copy to @msdisme .

msdisme commented 3 years ago

I spoke with the salesperson - he is not allowed to sell to the University. I will get the name of BU's salesperson and see if they can honor the quote, else it may need to be sent to the new salesperson by Nvidia.

msdisme commented 3 years ago

Got correct salesperson, forwarded the contact info to Lars. Please ask that they make it for 20 DPU's.

Thanks!

msdisme commented 3 years ago

Have a current quote and expect to place an order by March 31st, 2021.

msdisme commented 3 years ago

Order placed.

larsks commented 3 years ago

I've tried following up on this a couple of times with Nvidia and CDW, but no luck so far. I've just sent the following message to everyone involved:

I think that DPU purchase has become lost in bureaucratic limbo between Nvidia, CDW, and BU. The complicating factor is that the quote was generated for Red Hat but BU is trying to make the purchase, which threw everything off. I've been unable to get any response from the folks at CDW or Nvidia, so maybe our best bet is to have Michael cancel the purchase and start from scratch (or re-evaluate the purchase to see if folks are still interested in this equipment, since it's been so long since we originally tried to purchase things).

msdisme commented 3 years ago

Moving out of sprint 27 and 28 until the cards purchased/scheduled. @msdisme to create a new card for specifying machines for GPU and DPU

msdisme commented 2 years ago

Any update on the DPU order? We have the machines in smaug cluster, we have the 100GB connections....

larsks commented 2 years ago

I am completely out of the loop on what's going on with this. I think we should just close it -- if some at redhat wants dpus hosted in the MOC, they'll provide them, otherwise not something we need to track.

msdisme commented 2 years ago

Closing until it comes up again.