triton-inference-server / local_cache

Implementation of a local in-memory cache for Triton Inference Server's TRITONCACHE API
BSD 3-Clause "New" or "Revised" License
5 stars 1 forks source link

Add initial LocalCache implementation and Metrics #5

Closed rmccorm4 closed 1 year ago