kubernetes-sigs / wg-serving

WG Serving
https://github.com/kubernetes/community/tree/master/wg-serving
Apache License 2.0
13 stars 12 forks source link

[Serving Catalog] Add llama3-70b support for vllm #21

Closed jjk-g closed 3 weeks ago

jjk-g commented 1 month ago

Adds llama3-70b for vllm on 8x L4 GPUs.

ahg-g commented 1 month ago

Please add it to https://github.com/kubernetes-sigs/wg-serving/blob/main/serving-catalog/catalog.md

k8s-ci-robot commented 4 weeks ago

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jjk-g

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files: - ~~[serving-catalog/OWNERS](https://github.com/kubernetes-sigs/wg-serving/blob/main/serving-catalog/OWNERS)~~ [jjk-g] Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment
jjk-g commented 4 weeks ago

Is this supposed to get marked approved automatically when I push now? ^^^ @ahg-g

ahg-g commented 3 weeks ago

someone still needed to lgtm

/lgtm