Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
197
stars
31
forks
source link
fix: pass chat_template as a dict to ChatHistory #29
Closed
ms1design closed 3 months ago
Hi @dusty-nv, just a sad undefined variable fixed 🙌
P.S. Did you had time to experiment a bit with llama-3.1 tool calling already? :)
Cheers