chufengt / ALM-pedestrian-attribute

Code for the paper "Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization", ICCV 2019, http://arxiv.org/abs/1910.04562.
Apache License 2.0
188 stars 66 forks source link

Backbone changes #57

Closed abhigoku10 closed 3 years ago

abhigoku10 commented 3 years ago

@chufengt i had few queries on the backbone

  1. what was the reason of choice for using BN inception, since when i enable BNinception their is a boost in accuracy but since its heavier we get a drop in speed
  2. which backbone would be a better choice that can balance the trade-off between accuracy and speed ??
  3. which backbone is suitable for ease of integration of ALM with the backbone ?
  4. what are the bottlenecks in the current backbone used ?
chufengt commented 3 years ago
  1. just for convenience, some previous works used this backbone
  2. i didn't try other backbones, you can see some new models which can achieve better trade-offs, e.g. efficientnet?
  3. any backbone can use ALM
  4. didn't catch your point, but you can see bn-inception original papers
abhigoku10 commented 3 years ago

@chufengt 1,2,4 thanks for the response

  1. when backbone is changed should be modify the losses aspects accordingly or existing ALM loss equation works for this
chufengt commented 3 years ago

just keep the loss

abhigoku10 commented 3 years ago

@chufengt when using pretrained=True are you freezing only the last layer and training or retraining from begining

chufengt commented 3 years ago

no freezing, just for parameter initialization

abhigoku10 commented 3 years ago

@chufengt when we change the backbone to resent without including CAM and STN , does it work since we are providing annotation in a label format ie 1,0. so the main advantage of using label vector annotation with STN is for this reason right ??

chufengt commented 3 years ago

using ResNet only (w/o STN) can produce a reasonable result (see https://github.com/aajinjin/Strong_Baseline_of_Pedestrian_Attribute_Recognition) label vector annotation? it's just for convenience, nothing to do with STN or backbones