This is a PyTorch implementation of the paper "Multi-branch and Multi-scale Attention Learning for Fine-Grained Visual Categorization (MMAL-Net)" (Fan Zhang, Meng Li, Guisheng Zhai, Yizhao Liu).
Hi, thanks for your simple and efficient methods. I have some comments for your network.
1、Your classification results are based on the output of the second branch. Have you ever tried ensemble three branch. Does it improved?
2、In part branch, input size is 224 224 rather than 448448。This is tricky or not?
3、The metrics of localization iou is computed by total images or test dataset only?
Hi, thanks for your simple and efficient methods. I have some comments for your network. 1、Your classification results are based on the output of the second branch. Have you ever tried ensemble three branch. Does it improved? 2、In part branch, input size is 224 224 rather than 448448。This is tricky or not? 3、The metrics of localization iou is computed by total images or test dataset only?