Open littletomatodonkey opened 7 years ago
did anyone find solution to this?
I don't think there's a great way of doing this at the moment. You can change the network to accept multiple images as input, but then you'd need to retrain it. I think the best solution at the moment is to track each object in the frame independently; you can change the code to feed the images as a "batch" and output independent predictions for each image in the batch. Anything else that you wanted to do would probably be a research project.
Hello, I was wondering whether the code can be changed to track multiple objects per frame? I tried to modify the code to let it accept multiple objects positions input (for each line in the
groundtruth.txt
I wrote 8, 16 or 24... numbers to input multi-positions), but then I found the network input is just an image cropped by bounding-box(it is just my understanding...), and I can not find out how to change it.... Do you have any ideas about that? @davheld