triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend
Apache License 2.0
714 stars 108 forks source link

Update llama.md #604

Open surprisedPikachu007 opened 2 months ago

surprisedPikachu007 commented 2 months ago

replaced --kv_cache_type paged with --paged_kv_cache enable