gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
MIT License
6.51k stars 481 forks source link

Real-time capability and fine-tuning a model #115

Open Helge543 opened 1 year ago

Helge543 commented 1 year ago

Hi,

I hope this is the right place for my questions. The framework looks very promising to me and exciting but I am not 100% sure if I can use it for my project. I still have some questions that are important to me:

  1. Is it real-time capable, i.e. is it applicable to frames (every Nth frame/image) transmitted from an external camera device to my PC via Wi-Fi assuming that Wi-Fi connection is working well and "good" PC hardware is used (the graphics card I am using for example is an RTX 4070 Ti)?

  2. Can a model be fine-tuned on my own data set with several different classes and if so, exists code for training and how long does it take approximately on the already mentioned card or is it just ridiculous and far beyond it?

Thanks in advance!