neuralmagic / guidellm

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
Apache License 2.0
159 stars 11 forks source link

👷‍♂️ llama.cpp web server is added to GitHub Actions workflows #26

Open parfeniukink opened 2 months ago

parfeniukink commented 2 months ago

Requires 🔗 this PR to be merged first!

Summary

llama.cpp web server runs within GitHub external infrastructure to be able to run integration tests in the CI process