[Feature]: TP support in QLoRA of VLLM

bd-iaas-us / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

https://docs.vllm.ai

Apache License 2.0

1 stars 0 forks source link

Open chenqianfzh opened 3 weeks ago

chenqianfzh commented 3 weeks ago

Support tensor-parallelism in QLoRA on vllm.

No response

No response

chenqianfzh commented 1 week ago