This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.
Hi, I want to train COCO and VOC together using the same backbone. How can I ensure that the concat dataset has separate obj detection heads catering to the different number of classes in the two datasets, 80 and 20 respectively.
Is there any better workaround?
Any help will be appreciated!
Hi, I want to train COCO and VOC together using the same backbone. How can I ensure that the concat dataset has separate obj detection heads catering to the different number of classes in the two datasets, 80 and 20 respectively. Is there any better workaround? Any help will be appreciated!
Thanks