redhat-et / foundation-models-for-documentation

Improve ROSA customer experience (and customer retention) by leveraging foundation models to do “gpt-chat” style search of Red Hat customer documentation assets.
Other
24 stars 11 forks source link

[EPIC] Resource requirements and cost of foundation models #36

Open Shreyanand opened 1 year ago

Shreyanand commented 1 year ago

The large size of foundation models raise several resource and cost questions around deploying them in production. This EPIC will focus on creating experiments and showing results around some of the following questions:

codificat commented 1 year ago

About what happens when lowering precision, here's an interesting blog post: LLM.int8() and Emergent Features

suppathak commented 1 year ago

Adding here links useful for this issue:

Shreyanand commented 1 year ago

@suppathak These experiments (1, 2) are directly related to 1 GPU task that you're doing. You should adapt these for our context.