Open jackdong8588 opened 1 week ago
"woman in red dress" "man with glasses" "girl with blonde hair" "yellow car" "blue bicycle"
Can the Detic_new model adapt to the above prompt words?
Detic is not very good at those grounding expressions. Grounding DINO might be a better choice and yes, we can support those expressions, you can try!
Can you tell me what kind of prompt words should generally be filled in after the parameter --texts to accurately recognize the object? Directly filling in the name of the female lead in the movie would definitely not be recognized. Besides "girl" and "woman," can appearance descriptions be used for detection?