Closed BillChan226 closed 2 months ago
you can specify the abs video path by key videos
as follows
{"query": "Describe this video in detail. Don't repeat", "response": "xxxxxxxxx", "history": [], "videos": ["video_path"]}
Hi Thanks for the reply! It works! However I'm wondering how to set the number of frames sampled for each video when ft internvlw?
modify num_segments
in https://github.com/modelscope/swift/blob/main/swift/llm/utils/vision_utils.py#L116
Hi! Great works! I'm wondering if swift can support fine-tuning for InternVL2 on customized video-to-text dataset soon? Thanks!