marvik-ai / triton-llama2-adapter

MIT License
18 stars 3 forks source link

Triton License TritonServer

Deploying Llama2 with NVIDIA Triton Server tutorial

In this repository, we give an example on how to efficiently package and deploy Llama2, using NVIDIA Triton Inference Server to make it production-ready in no time.

Features

Examples

We cover three different deployment approaches:

Documentation