triton-inference-server / local_cache

Implementation of a local in-memory cache for Triton Inference Server's TRITONCACHE API
BSD 3-Clause "New" or "Revised" License
5 stars 1 forks source link

Flatten TRITONCACHE API structure, use callbacks to avoid unnecessary copies #4

Closed rmccorm4 closed 1 year ago

rmccorm4 commented 1 year ago

Corresponding core pr: https://github.com/triton-inference-server/core/pull/167