Add batch stats reporting to Triton

triton-inference-server / fil_backend

FIL backend for the Triton Inference Server

Apache License 2.0

72 stars 36 forks source link

Closed hcho3 closed 4 months ago

hcho3 commented 4 months ago

Use TRITONBACKEND_ModelInstanceReportBatchStatistics to report statistics about batching.

hcho3 commented 4 months ago

Already implemented in Rapids-Triton.