Support for multiple instances of same object class

mikkeljakobsen commented 3 years ago

Is it possible to train the network on a dataset with multiple instances of the same object class? I've tried to adapt the code to my own dataset but I'm unsure how to handle multiple instances of the same object. Best regards, Mikkel

ethnhe commented 3 years ago

It requires some modification to deal with multi instances in one scene using our full models:

For data preprocessing, the semantic label is kept still but the center and keypoint offset map should be modified. Specifically, take the center offset map as an example, it should be updated per-instance rather than per class here.
The training scripts can be kept still.
For inference:
- The PyTorch Cuda version of the MeanShift algorithm we implemented here only output one cluster center (one instance) for each class. It should be modified to output multi-instance clusters. Or you can use the skilearn version on CPU.
- After each instance cluster is obtained, the following keypoint and pose estimation procedure (After here) also need to be modified accordingly to output the pose parameter of each instance.

Another way is to utilize on-the-shelf instance segmentation/detection architecture to obtain the RoI of each instance and then cropped them out to train our pose estimation network. In this case, you can shrink the network of FFB6D, eg, replace ResNet34 with ResNet18 to speed up the network.

mikkeljakobsen commented 3 years ago

For data preprocessing, the semantic label is kept still but the center and keypoint offset map should be modified. Specifically, take the center offset map as an example, it should be updated per-instance rather than per class here.

Thanks for your answer. I tried to modify my data loader class according to your instructions. However, when I start training, I get this error:

RuntimeError: stack expects each tensor to be equal size, but got [6, 3, 4] at entry 0 and [5, 3, 4] at entry 2

I think the problem is that the number of instances differs from frame to frame, e.g. there may be 6 instances in one frame but only 5 instances in another. How do I deal with this issue?

hz-ants commented 3 years ago

Has this problem been solved?Who can share this code or more detailed suggestions?

leoflcn commented 2 years ago

I'd also like to know if this has been solved :)

nachi9211 commented 2 years ago

Hi, can someone point me towards correcting the code to output poses for all objects in the scene?

ethnhe / FFB6D

Support for multiple instances of same object class #11