defenseunicorns / leapfrogai

Production-ready Generative AI for local, cloud native, airgap, and edge deployments.
https://leapfrog.ai
Apache License 2.0
245 stars 25 forks source link

feat(backends): implement observability hooks for backends #297

Open gerred opened 3 months ago

gerred commented 3 months ago

User Story: Implement Backend Prometheus Metrics

As a backends operator I want to have Prometheus metrics for observability of the vLLM backend So that I can monitor the performance, health, and usage of the vLLM backend

Acceptance Criteria:

justinthelaw commented 5 days ago

Bump this issue with the following additions, on top of vLLM. Each one must have all of the criteria as listed for vLLM in the original issue description: