z-x-yang / Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
GNU Affero General Public License v3.0
2.75k stars 332 forks source link

How to get semantic information about what the mask represents #162

Open derekdlkdelike opened 2 months ago

derekdlkdelike commented 2 months ago

In WebUI I can get a mask map for each frame of the video, but how do I get the information about the object class represented by the segmented masks?