When the given text_prompt is “motorbike . bicycle . the telegraph pole ”, the result will contain “bicyclebi, telegraph pole, bicyclebike, telegraph, bicycle motorbike, the". I'd like to get some accurate categories that agree with the words I'm given. I know that raising the threshold or setting a condition to filter the recognition results would solve this problem, but it would result in a lot of missed boxes. How can I solve this problem?
When the given text_prompt is “motorbike . bicycle . the telegraph pole ”, the result will contain “bicyclebi, telegraph pole, bicyclebike, telegraph, bicycle motorbike, the". I'd like to get some accurate categories that agree with the words I'm given. I know that raising the threshold or setting a condition to filter the recognition results would solve this problem, but it would result in a lot of missed boxes. How can I solve this problem?