Real-time capability and fine-tuning a model

Hi,

I hope this is the right place for my questions. The framework looks very promising to me and exciting but I am not 100% sure if I can use it for my project. I still have some questions that are important to me:

Is it real-time capable, i.e. is it applicable to frames (every Nth frame/image) transmitted from an external camera device to my PC via Wi-Fi assuming that Wi-Fi connection is working well and "good" PC hardware is used (the graphics card I am using for example is an RTX 4070 Ti)?
Can a model be fine-tuned on my own data set with several different classes and if so, exists code for training and how long does it take approximately on the already mentioned card or is it just ridiculous and far beyond it?

Thanks in advance!

gaomingqi / Track-Anything

Real-time capability and fine-tuning a model #115