gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
MIT License
6.46k stars 480 forks source link

Bounding Box as input #58

Open chris-hndz opened 1 year ago

chris-hndz commented 1 year ago

Hello!

Thank you very much again for your good work with this project!

Have you considered incorporating a bounding box as input over the first frame, so that this bounding box selects not only the elements inside it, but also the segmentable elements contained in the bounding box as a set even if they even go a bit out of it, just like the SAM web demo does when selecting a head and incorporates part of the neck outside the bounding box as well?

DemoSAM_face

Source code of the SAM web demo: https://github.com/MiscellaneousStuff/meta-sam-demo

Thanks! :D

gaomingqi commented 1 year ago

Hello, thanks for your suggestion! Currently, we have no plan to incorporate box as input since point input can fulfil the requirements above for most cases. We probably consider this shortly and are happy to merge pull requests that further improve TAM's interactivity via the box input. Thanks!

licksylick commented 9 months ago

bounding box as input over the first frame.

Hello! You can try https://github.com/licksylick/AutoTrackAnything

It creates masks automatically and uses bboxes coordinates as inputs So basic using: you can detect objects and track them automatically. Hope this helps you :)

chris-hndz commented 9 months ago

Hi @licksylick !

Thanks for sharing your project! Can your project track the segmentation of body parts, for example, only the head of a person (face + hair + ears)?

licksylick commented 9 months ago

Thanks for sharing your project! Can your project track the segmentation of body parts, for example, only the head of a person (face + hair + ears)?

Of course. All that you need is change model (detector) for this parts (see Readme)

chris-hndz commented 9 months ago

Thank you @licksylick for your reply. We will test your project to see how it behaves with object part tracking.