z-x-yang / Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
GNU Affero General Public License v3.0
2.75k stars 332 forks source link

The confidence score is reduce when it have two text prompt #153

Open vodanhday opened 4 months ago

vodanhday commented 4 months ago

Hello, I am facing an issue with the text prompt. When I input one prompt like "person" or "worker" the confidence score is higher when I input two or more text prompts (ex. "worker, box"). How can I fix that? Thank you

image image