noahc1510 / trt-llm-rag-linux

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Linux using TensorRT-LLM
Other
19 stars 5 forks source link

Update for TensorRT-LLM V0.9 #1

Closed engineer1109 closed 8 months ago

engineer1109 commented 8 months ago

Update for TensorRT-LLM V0.9 Update to the latest TensorRT-LLM Have tested successfully on nvidia docker cuda 12.1