Open walternat1ve opened 3 months ago
furthermore i noticed that you use the class labels as prompts for florence-2, right?
another observation is that results differ a lot when prompts are combined. it seems to be better if one does prompt after prompt and then aggregate all target classes into one result.
Hi! I found that same problem. Check the issue I opened at Florence-2 repo; I think it really clears things up :smiley:
i noticed that you add a bias by prefixing each prompt with "a photo of ", this is not what the normal user expects. please remove it as this is influencing results. thx.