Open huai-ying opened 3 months ago
tensorrt加速确实很厉害,但是他的并发性没有vllm做的好
No response
配合triton-inference-server如何?
🚀 The feature, motivation and pitch
tensorrt加速确实很厉害,但是他的并发性没有vllm做的好
Alternatives
No response
Additional context
No response