The Triton Inference Server provides an optimized cloud and edge inferencing solution.
BSD 3-Clause "New" or "Revised" License
8.39k
stars
1.49k
forks
source link
Build: Upgrading vLLM version for 24.08 release #7539
Closed
pvijayakrish closed 3 months ago
What does the PR do?
Checklist
<commit_type>: <Title>
Commit Type:
Check the conventional commit type box here and add the label to the github PR.
Background
Upgrading the vLLM version to reference the current latest v0.5.4 for 24.08 release.