Closed Violettttee closed 3 weeks ago
Good point. I want to confirm if you want to only evaluate your model with single-image tasks or if you want to exclude video tasks? If it's the former, you can consider using the singe-image setting.
Our data preprocessing pipeline extracts frames from a video file automatically based on pre-defined per-model frame sampling parameters. If your model can take multiple images, it can run with the video tasks in our benchmark.
Good point. I want to confirm if you want to only evaluate your model with single-image tasks or if you want to exclude video tasks? If it's the former, you can consider using the singe-image setting.
Our data preprocessing pipeline extracts frames from a video file automatically based on pre-defined per-model frame sampling parameters. If your model can take multiple images, it can run with the video tasks in our benchmark.
Thanks for your replying~ Does that mean the "single-image" subtasks have already exclude video file?I thought it means just use one image since the "query_media" filed is a list which may contain more than one image at first. I think we just want to exclude video tasks(our model can process multiple images but we just want to evaluate images task for now~).
Good point. I want to confirm if you want to only evaluate your model with single-image tasks or if you want to exclude video tasks? If it's the former, you can consider using the singe-image setting. Our data preprocessing pipeline extracts frames from a video file automatically based on pre-defined per-model frame sampling parameters. If your model can take multiple images, it can run with the video tasks in our benchmark.
Thanks for your replying~ Does that mean the "single-image" subtasks have already exclude video file?I thought it means just use one image since the "query_media" filed is a list which may contain more than one image at first. I think we just want to exclude video tasks(our model can process multiple images but we just want to evaluate images task for now~).
The single-image subset excludes all tasks that contain multiple images or videos. Using the Core set as an example, the single-image subset contains 273 tasks out of the full 440 Core tasks. We also exclude the image input in the 1-shot demonstration example so that the entire query only contains a single image. Maybe you can try the single-image subset first?
I see. We can indeed add a flag to enable only running image tasks. Some multi-image tasks contain a list of images extracted manually from a video by the task annotator, so I guess you don't want to evaluate your model on those tasks as well?
Oh,i got the point!I have been taking the "video" task as a real video task, which is different from images tasks.If its based on pre-defined per-model frame sampling parameters,i think it just consist of multiple images and we dont need to add "video" inputs and can just follow your process! Thanks a lot for your help~
our model evaluation repo doesnt finish video inference yet, and we just want to evaluate our model on images task.Is there any additional configuration could help this?(I see the input process has dealt with video or image file.If i just add additional process on input process, could evaluation goes properly?)