BAAI-DCAI / Bunny

A family of lightweight multimodal models.
Apache License 2.0
932 stars 69 forks source link

Question: What is the minimum size of an image that can be classified? #128

Open ChangGiMoon opened 1 month ago

ChangGiMoon commented 1 month ago

I want to use Bunny-Llama-3-8B-V to do binary classification (e.g. yes or no) on very small images. Specifically, I am going to use the cropped bounding box patch (which is the result of Object Detection) as input for Bunny-Llama-3-8B-V and verify whether the class of bounding box is correct. Can you tell me the approximate minimum image size that can be classified? The prompt will use the following input. The input images have various sizes and appearances as shown below.

prompt: Based on the given image, answer the following question with 'yes' or 'no': Question: [Is there a person in this image?], Answer:

input image example: ex

Isaachhh commented 1 month ago

The input image would be resized to 448x448 for Bunny-Llama-3-8B-V.