Open LetheRiver0 opened 2 months ago
Hi, Amazing job for new llava-next-video model! Since it has 34B params and maybe need more than 1 GPU, so do we have support some inference accelerate method for new llava-next-video models? like sglang deploy. Thanks~
+1
Hi, Amazing job for new llava-next-video model! Since it has 34B params and maybe need more than 1 GPU, so do we have support some inference accelerate method for new llava-next-video models? like sglang deploy. Thanks~