Open huliang2016 opened 1 year ago
LMDeploy is a toolkit for compressing, deploying, and serving LLMs. It also supports llama and llama-2.
Wondering if it could be the fastest servering tools.
Maybe I will test it if I have additional time.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs. It also supports llama and llama-2.
Wondering if it could be the fastest servering tools.