triton-inference-server / dali_backend

The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.
https://docs.nvidia.com/deeplearning/dali/user-guide/docs/index.html
MIT License
118 stars 28 forks source link

"Training to inference" tutorial #211

Closed szalpal closed 10 months ago

szalpal commented 10 months ago

This PR introduces a training -> inference tutorial. The use-case is based on Efficientnet model, presented both in DeepLearningExamples and DALI repository. All assumptions and explanations are provided inside.