triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.
https://docs.nvidia.com/deeplearning/triton-inference-server/user-guide/docs/index.html
BSD 3-Clause "New" or "Revised" License
8.38k stars 1.49k forks source link

fix: Copy models out of NFS before starting Triton to avoid intermittent startup timeouts #7730

Closed rmccorm4 closed 1 month ago

rmccorm4 commented 1 month ago

See Gitlab PR !1316 for more details other than title.