Closed kye9216789 closed 4 years ago
- The ResNext_x32x4d is already a very big backbone, it is almost the biggest backbone can be used by 1080Ti with Mask R-CNN.
- Enlarge the output size of Mask RoI Features will also enlarge the memory usage significantly since mask head is very costly.
Thanks for your answer.
Hello.
I am training Mask R-CNN for my own dataset using same format MS COCO Dataset. The only difference is the number of classes is 2 (means single class). I only want find a single object in one image, so I set number of gts is just 1. Image size is (600,600) and size_divizer is 32, so True image size is 608. I often encounter CUDA OOM when I use resnext 32x4 backbone.
I read https://github.com/open-mmlab/mmdetection/issues/188, and found that main factor for gpu memory is number of gt, and aspect ratio.
But I have no problem in both factors. I encounter OOM when I use images_per_gpu=2, and 1 is ok.
I modified mask head size from 28 to 128 because my object is very thin, and I needed high resolution in mask head and mask RoIExtractor. RoIExtractor out size is 64.
I don`t understand why it happens. My machine has two of GTX1080ti, with graphic memory 11G Below is my config file.