Closed htlou closed 2 months ago
Add the Qwen2-VL model, which represents a text_image_video_to_text model.
No response
Required prerequisites
Motivation
Add the Qwen2-VL model, which represents a text_image_video_to_text model.
Solution
No response
Alternatives
No response
Additional context
No response