Closed lxtGH closed 3 years ago
This config should reproduce the pretrained checkpoint. What is the image-caption matching accuracy you get after the last epoch? Could you share the logs or tensorboard plot?
Here is the log file: https://github.com/lxtGH/ovr-cnn/blob/master/log.txt The Masked Language Modeling Accuracy is nearly zero.
Hi! Alireza @alirezazareian I think I found the reason. Again, I pretrained the model again. https://github.com/lxtGH/ovr-cnn/blob/master/log_nex.txt This time the Masked Language Modeling Accuracy is normal. It seems that the pre-train stage is not very stable. I will report the fine-tuning results using this model here.
@alirezazareian Hi! still can not reproduce the results for the finetuning stage. AP50_split_seen is much lower than using your pretrained model
@alirezazareian Hi! Could you share your pretraining log for reference? I could find the difference.
Or you can desribe the pytorch version your have used?
This should be my training log, although I am not entirely sure as it's been a while since I trained it. Could you find any difference with your log?
2020-08-04 11:05:18,287 maskrcnn_benchmark INFO: Using 8 GPUs
2020-08-04 11:05:18,287 maskrcnn_benchmark INFO: Namespace(config_file='configs/mmss_v07.yaml', distributed=True, local_rank=0, opts=['OUTPUT_DIR', '/home/alireza/runs/vltrain/121'], skip_test=True)
2020-08-04 11:05:18,287 maskrcnn_benchmark INFO: Collecting env info (might take some time)
2020-08-04 11:05:26,791 maskrcnn_benchmark INFO:
PyTorch version: 1.0.0.dev20190328
Is debug build: No
CUDA used to build PyTorch: 10.0.130
OS: Debian GNU/Linux 9.12 (stretch)
GCC version: (Debian 6.3.0-18+deb9u1) 6.3.0 20170516
CMake version: Could not collect
Python version: 3.6
Is CUDA available: Yes
CUDA runtime version: 10.0.130
GPU models and configuration:
GPU 0: Tesla V100-SXM2-16GB
GPU 1: Tesla V100-SXM2-16GB
GPU 2: Tesla V100-SXM2-16GB
GPU 3: Tesla V100-SXM2-16GB
GPU 4: Tesla V100-SXM2-16GB
GPU 5: Tesla V100-SXM2-16GB
GPU 6: Tesla V100-SXM2-16GB
GPU 7: Tesla V100-SXM2-16GB
Nvidia driver version: 418.87.01
cuDNN version: Could not collect
Versions of relevant libraries:
[pip3] numpy==1.18.5
[pip3] torch==1.4.0
[pip3] torchvision==0.2.2
[conda] mkl 2020.1 217
[conda] pytorch 1.4.0 py3.6_cuda10.0.130_cudnn7.6.3_0 pytorch
[conda] pytorch-nightly 1.0.0.dev20190328 py3.6_cuda10.0.130_cudnn7.4.2_0 pytorch
[conda] torchvision 0.2.2 py_3 pytorch
Pillow (7.1.2)
2020-08-04 11:05:26,794 maskrcnn_benchmark INFO: Loaded configuration file configs/mmss_v07.yaml
2020-08-04 11:05:26,794 maskrcnn_benchmark INFO:
MODEL:
META_ARCHITECTURE: "MMSS-GCNN"
WEIGHT: "catalog://ImageNetPretrained/MSRA/R-50"
RESNETS:
BACKBONE_OUT_CHANNELS: 2048
BACKBONE:
CONV_BODY: "R-50-C5"
FREEZE_CONV_BODY_AT: 0
LANGUAGE_BACKBONE:
TYPE: "BERT-Base"
FREEZE: True
MMSS_HEAD:
TYPES: ("GroundingHead", "TransformerHead")
DEFAULT_HEAD: "GroundingHead"
TIE_VL_PROJECTION_WEIGHTS: True
SPATIAL_DROPOUT: 100
GROUNDING:
LOCAL_METRIC: "dot"
GLOBAL_METRIC: "aligned_local"
ALIGNMENT: "softmax"
# 100.0 for euclidean, 10.0 for dot, 0.01 for cosine
ALIGNMENT_TEMPERATURE: 10.0
LOSS: "cross_entropy"
# NEGATIVE_MINING: 'random'
# TRIPLET_MARGIN: 1.0
ALIGN_WORDS_TO_REGIONS: True
ALIGN_REGIONS_TO_WORDS: True
TRANSFORMER:
MASKED_LANGUAGE_MODELING: True
MASKED_LANGUAGE_MODELING_VALIDATION: False
MASKED_VISUAL_MODELING: False
MVM_LOSS: ''
MMM_LOSS: 'cross_entropy'
BERT_CONFIG:
num_hidden_layers: 6
num_attention_heads: 8
intermediate_size: 768
DATASETS:
TRAIN: ("coco_captions_train",)
TEST: ("coco_captions_val",)
DATASET_CLASS: "COCOCaptionsDataset"
SOLVER:
BASE_LR: 0.01
WEIGHT_DECAY: 0.0001
STEPS: (20000, 35000)
MAX_ITER: 40000
IMS_PER_BATCH: 64
TEST_PERIOD: 1000
CHECKPOINT_PERIOD: 1000
LOG_PERIOD: 100
CLIP_GRAD_NORM_AT: 5.0
GRADIENT_ACCUMULATION_STEPS: 1
USE_TRAIN_MODE_FOR_VALIDATION_LOSS: False
TEST:
DO_EVAL: False
IMS_PER_BATCH: 64
2020-08-04 11:05:26,795 maskrcnn_benchmark INFO: Running with config:
AMP_VERBOSE: False
DATALOADER:
ASPECT_RATIO_GROUPING: True
NUM_WORKERS: 4
SIZE_DIVISIBILITY: 0
DATASETS:
DATASET_ARGS:
LOAD_EMBEDDINGS: False
DATASET_CLASS: COCOCaptionsDataset
TEST: ('coco_captions_val',)
TRAIN: ('coco_captions_train',)
DTYPE: float32
INPUT:
BRIGHTNESS: 0.0
CONTRAST: 0.0
HORIZONTAL_FLIP_PROB_TRAIN: 0.5
HUE: 0.0
MAX_SIZE_TEST: 1333
MAX_SIZE_TRAIN: 1333
MIN_SIZE_TEST: 800
MIN_SIZE_TRAIN: (800,)
PIXEL_MEAN: [102.9801, 115.9465, 122.7717]
PIXEL_STD: [1.0, 1.0, 1.0]
SATURATION: 0.0
TO_BGR255: True
VERTICAL_FLIP_PROB_TRAIN: 0.0
MODEL:
BACKBONE:
CONV_BODY: R-50-C5
FREEZE_CONV_BODY_AT: 0
CLS_AGNOSTIC_BBOX_REG: False
DEVICE: cuda
FBNET:
ARCH: default
ARCH_DEF:
BN_TYPE: bn
DET_HEAD_BLOCKS: []
DET_HEAD_LAST_SCALE: 1.0
DET_HEAD_STRIDE: 0
DW_CONV_SKIP_BN: True
DW_CONV_SKIP_RELU: True
KPTS_HEAD_BLOCKS: []
KPTS_HEAD_LAST_SCALE: 0.0
KPTS_HEAD_STRIDE: 0
MASK_HEAD_BLOCKS: []
MASK_HEAD_LAST_SCALE: 0.0
MASK_HEAD_STRIDE: 0
RPN_BN_TYPE:
RPN_HEAD_BLOCKS: 0
SCALE_FACTOR: 1.0
WIDTH_DIVISOR: 1
FPN:
USE_GN: False
USE_RELU: False
GROUP_NORM:
DIM_PER_GP: -1
EPSILON: 1e-05
NUM_GROUPS: 32
KEYPOINT_ON: False
LANGUAGE_BACKBONE:
ADD_POSITION_EMBEDDING: False
EMBEDDING_PATH:
FREEZE: True
TYPE: BERT-Base
LOAD_EMB_PRED_FROM_MMSS_HEAD: False
LOAD_LAYER4_FROM_BACKBONE: False
LOAD_TRAINER_STATE: True
MASK_ON: False
META_ARCHITECTURE: MMSS-GCNN
MMSS_HEAD:
DEFAULT_HEAD: GroundingHead
GROUNDING:
ALIGNMENT: softmax
ALIGNMENT_TEMPERATURE: 10.0
ALIGN_REGIONS_TO_WORDS: True
ALIGN_WORDS_TO_REGIONS: True
GLOBAL_METRIC: aligned_local
LOCAL_METRIC: dot
LOSS: cross_entropy
NEGATIVE_MINING: random
TRIPLET_MARGIN: 1.0
SPATIAL_DROPOUT: 100
TIE_VL_PROJECTION_WEIGHTS: True
TRANSFORMER:
BERT_CONFIG:
attention_probs_dropout_prob: 0.1
gradient_checkpointing: False
hidden_act: gelu
hidden_dropout_prob: 0.1
hidden_size: 768
initializer_range: 0.02
intermediate_size: 768
layer_norm_eps: 1e-12
max_position_embeddings: 512
num_attention_heads: 8
num_hidden_layers: 6
pad_token_id: 0
type_vocab_size: 2
vocab_size: 30522
MASKED_LANGUAGE_MODELING: True
MASKED_LANGUAGE_MODELING_PROB: 0.15
MASKED_LANGUAGE_MODELING_PROB_MASK: 0.9
MASKED_LANGUAGE_MODELING_PROB_NOISE: 0.0
MASKED_LANGUAGE_MODELING_VALIDATION: False
MASKED_VISUAL_MODELING: False
MMM_LOSS: cross_entropy
MVM_LOSS:
MVM_LOSS_NUM_NEGATIVE: 128
TYPES: ('GroundingHead', 'TransformerHead')
RESNETS:
BACKBONE_OUT_CHANNELS: 2048
DEFORMABLE_GROUPS: 1
NUM_GROUPS: 1
RES2_OUT_CHANNELS: 256
RES5_DILATION: 1
STAGE_WITH_DCN: (False, False, False, False)
STEM_FUNC: StemWithFixedBatchNorm
STEM_OUT_CHANNELS: 64
STRIDE_IN_1X1: True
TRANS_FUNC: BottleneckWithFixedBatchNorm
WIDTH_PER_GROUP: 64
WITH_MODULATED_DCN: False
RETINANET:
ANCHOR_SIZES: (32, 64, 128, 256, 512)
ANCHOR_STRIDES: (8, 16, 32, 64, 128)
ASPECT_RATIOS: (0.5, 1.0, 2.0)
BBOX_REG_BETA: 0.11
BBOX_REG_WEIGHT: 4.0
BG_IOU_THRESHOLD: 0.4
FG_IOU_THRESHOLD: 0.5
INFERENCE_TH: 0.05
LOSS_ALPHA: 0.25
LOSS_GAMMA: 2.0
NMS_TH: 0.4
NUM_CLASSES: 81
NUM_CONVS: 4
OCTAVE: 2.0
PRE_NMS_TOP_N: 1000
PRIOR_PROB: 0.01
SCALES_PER_OCTAVE: 3
STRADDLE_THRESH: 0
USE_C5: True
RETINANET_ON: False
ROI_BOX_HEAD:
CONV_HEAD_DIM: 256
DILATION: 1
EMBEDDING_BASED: False
EMB_DIM: 300
FEATURE_EXTRACTOR: ResNet50Conv5ROIFeatureExtractor
FREEZE_EMB_PRED: False
FREEZE_FEATURE_EXTRACTOR: False
LOSS_WEIGHT_BACKGROUND: 1.0
MLP_HEAD_DIM: 1024
NUM_CLASSES: 81
NUM_STACKED_CONVS: 4
POOLER_RESOLUTION: 14
POOLER_SAMPLING_RATIO: 0
POOLER_SCALES: (0.0625,)
PREDICTOR: FastRCNNPredictor
USE_GN: False
ROI_HEADS:
BATCH_SIZE_PER_IMAGE: 512
BBOX_REG_WEIGHTS: (10.0, 10.0, 5.0, 5.0)
BG_IOU_THRESHOLD: 0.5
DETECTIONS_PER_IMG: 100
FG_IOU_THRESHOLD: 0.5
NMS: 0.5
POSITIVE_FRACTION: 0.25
SCORE_THRESH: 0.05
USE_FPN: False
ROI_KEYPOINT_HEAD:
CONV_LAYERS: (512, 512, 512, 512, 512, 512, 512, 512)
FEATURE_EXTRACTOR: KeypointRCNNFeatureExtractor
MLP_HEAD_DIM: 1024
NUM_CLASSES: 17
POOLER_RESOLUTION: 14
POOLER_SAMPLING_RATIO: 0
POOLER_SCALES: (0.0625,)
PREDICTOR: KeypointRCNNPredictor
RESOLUTION: 14
SHARE_BOX_FEATURE_EXTRACTOR: True
ROI_MASK_HEAD:
CONV_LAYERS: (256, 256, 256, 256)
DILATION: 1
FEATURE_EXTRACTOR: ResNet50Conv5ROIFeatureExtractor
MLP_HEAD_DIM: 1024
POOLER_RESOLUTION: 14
POOLER_SAMPLING_RATIO: 0
POOLER_SCALES: (0.0625,)
POSTPROCESS_MASKS: False
POSTPROCESS_MASKS_THRESHOLD: 0.5
PREDICTOR: MaskRCNNC4Predictor
RESOLUTION: 14
SHARE_BOX_FEATURE_EXTRACTOR: True
USE_GN: False
RPN:
ANCHOR_SIZES: (32, 64, 128, 256, 512)
ANCHOR_STRIDE: (16,)
ASPECT_RATIOS: (0.5, 1.0, 2.0)
BATCH_SIZE_PER_IMAGE: 256
BG_IOU_THRESHOLD: 0.3
FG_IOU_THRESHOLD: 0.7
FPN_POST_NMS_PER_BATCH: True
FPN_POST_NMS_TOP_N_TEST: 2000
FPN_POST_NMS_TOP_N_TRAIN: 2000
MIN_SIZE: 0
NMS_THRESH: 0.7
POSITIVE_FRACTION: 0.5
POST_NMS_TOP_N_TEST: 1000
POST_NMS_TOP_N_TRAIN: 2000
PRE_NMS_TOP_N_TEST: 6000
PRE_NMS_TOP_N_TRAIN: 12000
RPN_HEAD: SingleConvRPNHead
STRADDLE_THRESH: 0
USE_FPN: False
RPN_ONLY: False
WEIGHT: catalog://ImageNetPretrained/MSRA/R-50
OUTPUT_DIR: /home/alireza/runs/vltrain/121
PATHS_CATALOG: /home/alireza/maskrcnn/maskrcnn_benchmark/config/paths_catalog.py
SOLVER:
BASE_LR: 0.01
BIAS_LR_FACTOR: 2
CHECKPOINT_PERIOD: 1000
CLIP_GRAD_NORM_AT: 5.0
GAMMA: 0.1
GRADIENT_ACCUMULATION_STEPS: 1
IMS_PER_BATCH: 64
LOG_PERIOD: 100
MAX_ITER: 40000
MOMENTUM: 0.9
STEPS: (20000, 35000)
TEST_PERIOD: 1000
USE_TRAIN_MODE_FOR_VALIDATION_LOSS: False
WARMUP_FACTOR: 0.3333333333333333
WARMUP_ITERS: 500
WARMUP_METHOD: linear
WEIGHT_DECAY: 0.0001
WEIGHT_DECAY_BIAS: 0
TEST:
BBOX_AUG:
ENABLED: False
H_FLIP: False
MAX_SIZE: 4000
SCALES: ()
SCALE_H_FLIP: False
DETECTIONS_PER_IMG: 100
DO_EVAL: False
EXPECTED_RESULTS: []
EXPECTED_RESULTS_SIGMA_TOL: 4
IMS_PER_BATCH: 64
2020-08-04 11:05:26,795 maskrcnn_benchmark INFO: Saving config into: /home/alireza/runs/vltrain/121/config.yml
2020-08-04 11:05:32,401 maskrcnn_benchmark.make_optimizer INFO: The following parameters will be trained:
2020-08-04 11:05:32,401 maskrcnn_benchmark.make_optimizer INFO: backbone.body.stem.conv1.weight
2020-08-04 11:05:32,401 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.0.downsample.0.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.0.conv1.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.0.conv2.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.0.conv3.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.1.conv1.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.1.conv2.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.1.conv3.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.2.conv1.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.2.conv2.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer1.2.conv3.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.0.downsample.0.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.0.conv1.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.0.conv2.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.0.conv3.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.1.conv1.weight
2020-08-04 11:05:32,402 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.1.conv2.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.1.conv3.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.2.conv1.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.2.conv2.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.2.conv3.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.3.conv1.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.3.conv2.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer2.3.conv3.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.0.downsample.0.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.0.conv1.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.0.conv2.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.0.conv3.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.1.conv1.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.1.conv2.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.1.conv3.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.2.conv1.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.2.conv2.weight
2020-08-04 11:05:32,403 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.2.conv3.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.3.conv1.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.3.conv2.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.3.conv3.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.4.conv1.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.4.conv2.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.4.conv3.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.5.conv1.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.5.conv2.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer3.5.conv3.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.0.downsample.0.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.0.conv1.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.0.conv2.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.0.conv3.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.1.conv1.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.1.conv2.weight
2020-08-04 11:05:32,404 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.1.conv3.weight
2020-08-04 11:05:32,405 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.2.conv1.weight
2020-08-04 11:05:32,405 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.2.conv2.weight
2020-08-04 11:05:32,405 maskrcnn_benchmark.make_optimizer INFO: backbone.body.layer4.2.conv3.weight
2020-08-04 11:05:32,406 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.GroundingHead.v2l_projection.weight
2020-08-04 11:05:32,406 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.GroundingHead.v2l_projection.bias
2020-08-04 11:05:32,406 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.visual_emb.image_embeddings.weight
2020-08-04 11:05:32,406 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.visual_emb.image_embeddings.bias
2020-08-04 11:05:32,406 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.visual_emb.image_location_embeddings.weight
2020-08-04 11:05:32,406 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.visual_emb.image_location_embeddings.bias
2020-08-04 11:05:32,406 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.visual_emb.LayerNorm.weight
2020-08-04 11:05:32,406 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.visual_emb.LayerNorm.bias
2020-08-04 11:05:32,406 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.self.query.weight
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.self.query.bias
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.self.key.weight
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.self.key.bias
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.self.value.weight
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.self.value.bias
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.output.dense.weight
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.output.dense.bias
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.output.LayerNorm.weight
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.attention.output.LayerNorm.bias
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.intermediate.dense.weight
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.intermediate.dense.bias
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.output.dense.weight
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.output.dense.bias
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.output.LayerNorm.weight
2020-08-04 11:05:32,407 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.0.output.LayerNorm.bias
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.self.query.weight
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.self.query.bias
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.self.key.weight
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.self.key.bias
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.self.value.weight
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.self.value.bias
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.output.dense.weight
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.output.dense.bias
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.output.LayerNorm.weight
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.attention.output.LayerNorm.bias
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.intermediate.dense.weight
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.intermediate.dense.bias
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.output.dense.weight
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.output.dense.bias
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.output.LayerNorm.weight
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.1.output.LayerNorm.bias
2020-08-04 11:05:32,408 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.self.query.weight
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.self.query.bias
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.self.key.weight
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.self.key.bias
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.self.value.weight
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.self.value.bias
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.output.dense.weight
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.output.dense.bias
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.output.LayerNorm.weight
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.attention.output.LayerNorm.bias
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.intermediate.dense.weight
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.intermediate.dense.bias
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.output.dense.weight
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.output.dense.bias
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.output.LayerNorm.weight
2020-08-04 11:05:32,409 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.2.output.LayerNorm.bias
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.self.query.weight
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.self.query.bias
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.self.key.weight
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.self.key.bias
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.self.value.weight
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.self.value.bias
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.output.dense.weight
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.output.dense.bias
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.output.LayerNorm.weight
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.attention.output.LayerNorm.bias
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.intermediate.dense.weight
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.intermediate.dense.bias
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.output.dense.weight
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.output.dense.bias
2020-08-04 11:05:32,410 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.output.LayerNorm.weight
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.3.output.LayerNorm.bias
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.self.query.weight
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.self.query.bias
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.self.key.weight
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.self.key.bias
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.self.value.weight
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.self.value.bias
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.output.dense.weight
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.output.dense.bias
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.output.LayerNorm.weight
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.attention.output.LayerNorm.bias
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.intermediate.dense.weight
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.intermediate.dense.bias
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.output.dense.weight
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.output.dense.bias
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.output.LayerNorm.weight
2020-08-04 11:05:32,411 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.4.output.LayerNorm.bias
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.self.query.weight
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.self.query.bias
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.self.key.weight
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.self.key.bias
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.self.value.weight
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.self.value.bias
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.output.dense.weight
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.output.dense.bias
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.output.LayerNorm.weight
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.attention.output.LayerNorm.bias
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.intermediate.dense.weight
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.intermediate.dense.bias
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.output.dense.weight
2020-08-04 11:05:32,412 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.output.dense.bias
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.output.LayerNorm.weight
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.encoder.layer.5.output.LayerNorm.bias
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.pooler.dense.weight
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.pooler.dense.bias
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.heads.predictions.bias
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.heads.predictions.transform.dense.weight
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.heads.predictions.transform.dense.bias
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.heads.predictions.transform.LayerNorm.weight
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.heads.predictions.transform.LayerNorm.bias
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.heads.bi_seq_relationship.weight
2020-08-04 11:05:32,413 maskrcnn_benchmark.make_optimizer INFO: mmss_heads.TransformerHead.heads.bi_seq_relationship.bias
2020-08-04 11:05:32,780 maskrcnn_benchmark.utils.checkpoint INFO: Loading checkpoint from catalog://ImageNetPretrained/MSRA/R-50
2020-08-04 11:05:32,780 maskrcnn_benchmark.utils.checkpoint INFO: catalog://ImageNetPretrained/MSRA/R-50 points to https://dl.fbaipublicfiles.com/detectron/ImageNetPretrained/MSRA/R-50.pkl
2020-08-04 11:05:33,118 maskrcnn_benchmark.utils.checkpoint INFO: url https://dl.fbaipublicfiles.com/detectron/ImageNetPretrained/MSRA/R-50.pkl cached in /home/alireza/.torch/models/R-50.pkl
2020-08-04 11:05:33,398 maskrcnn_benchmark.utils.c2_model_loading INFO: Remapping C2 weights
2020-08-04 11:05:33,398 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: conv1_b mapped name: conv1.bias
2020-08-04 11:05:33,398 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: conv1_w mapped name: conv1.weight
2020-08-04 11:05:33,398 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: fc1000_b mapped name: fc1000.bias
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: fc1000_w mapped name: fc1000.weight
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch1_b mapped name: layer1.0.downsample.0.bias
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch1_bn_b mapped name: layer1.0.downsample.1.bias
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch1_bn_s mapped name: layer1.0.downsample.1.weight
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch1_w mapped name: layer1.0.downsample.0.weight
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2a_b mapped name: layer1.0.conv1.bias
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2a_bn_b mapped name: layer1.0.bn1.bias
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2a_bn_s mapped name: layer1.0.bn1.weight
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2a_w mapped name: layer1.0.conv1.weight
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2b_b mapped name: layer1.0.conv2.bias
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2b_bn_b mapped name: layer1.0.bn2.bias
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2b_bn_s mapped name: layer1.0.bn2.weight
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2b_w mapped name: layer1.0.conv2.weight
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2c_b mapped name: layer1.0.conv3.bias
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2c_bn_b mapped name: layer1.0.bn3.bias
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2c_bn_s mapped name: layer1.0.bn3.weight
2020-08-04 11:05:33,399 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_0_branch2c_w mapped name: layer1.0.conv3.weight
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2a_b mapped name: layer1.1.conv1.bias
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2a_bn_b mapped name: layer1.1.bn1.bias
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2a_bn_s mapped name: layer1.1.bn1.weight
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2a_w mapped name: layer1.1.conv1.weight
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2b_b mapped name: layer1.1.conv2.bias
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2b_bn_b mapped name: layer1.1.bn2.bias
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2b_bn_s mapped name: layer1.1.bn2.weight
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2b_w mapped name: layer1.1.conv2.weight
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2c_b mapped name: layer1.1.conv3.bias
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2c_bn_b mapped name: layer1.1.bn3.bias
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2c_bn_s mapped name: layer1.1.bn3.weight
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_1_branch2c_w mapped name: layer1.1.conv3.weight
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2a_b mapped name: layer1.2.conv1.bias
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2a_bn_b mapped name: layer1.2.bn1.bias
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2a_bn_s mapped name: layer1.2.bn1.weight
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2a_w mapped name: layer1.2.conv1.weight
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2b_b mapped name: layer1.2.conv2.bias
2020-08-04 11:05:33,400 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2b_bn_b mapped name: layer1.2.bn2.bias
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2b_bn_s mapped name: layer1.2.bn2.weight
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2b_w mapped name: layer1.2.conv2.weight
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2c_b mapped name: layer1.2.conv3.bias
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2c_bn_b mapped name: layer1.2.bn3.bias
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2c_bn_s mapped name: layer1.2.bn3.weight
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res2_2_branch2c_w mapped name: layer1.2.conv3.weight
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch1_b mapped name: layer2.0.downsample.0.bias
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch1_bn_b mapped name: layer2.0.downsample.1.bias
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch1_bn_s mapped name: layer2.0.downsample.1.weight
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch1_w mapped name: layer2.0.downsample.0.weight
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2a_b mapped name: layer2.0.conv1.bias
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2a_bn_b mapped name: layer2.0.bn1.bias
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2a_bn_s mapped name: layer2.0.bn1.weight
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2a_w mapped name: layer2.0.conv1.weight
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2b_b mapped name: layer2.0.conv2.bias
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2b_bn_b mapped name: layer2.0.bn2.bias
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2b_bn_s mapped name: layer2.0.bn2.weight
2020-08-04 11:05:33,401 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2b_w mapped name: layer2.0.conv2.weight
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2c_b mapped name: layer2.0.conv3.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2c_bn_b mapped name: layer2.0.bn3.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2c_bn_s mapped name: layer2.0.bn3.weight
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_0_branch2c_w mapped name: layer2.0.conv3.weight
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2a_b mapped name: layer2.1.conv1.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2a_bn_b mapped name: layer2.1.bn1.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2a_bn_s mapped name: layer2.1.bn1.weight
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2a_w mapped name: layer2.1.conv1.weight
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2b_b mapped name: layer2.1.conv2.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2b_bn_b mapped name: layer2.1.bn2.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2b_bn_s mapped name: layer2.1.bn2.weight
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2b_w mapped name: layer2.1.conv2.weight
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2c_b mapped name: layer2.1.conv3.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2c_bn_b mapped name: layer2.1.bn3.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2c_bn_s mapped name: layer2.1.bn3.weight
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_1_branch2c_w mapped name: layer2.1.conv3.weight
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2a_b mapped name: layer2.2.conv1.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2a_bn_b mapped name: layer2.2.bn1.bias
2020-08-04 11:05:33,402 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2a_bn_s mapped name: layer2.2.bn1.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2a_w mapped name: layer2.2.conv1.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2b_b mapped name: layer2.2.conv2.bias
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2b_bn_b mapped name: layer2.2.bn2.bias
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2b_bn_s mapped name: layer2.2.bn2.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2b_w mapped name: layer2.2.conv2.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2c_b mapped name: layer2.2.conv3.bias
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2c_bn_b mapped name: layer2.2.bn3.bias
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2c_bn_s mapped name: layer2.2.bn3.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_2_branch2c_w mapped name: layer2.2.conv3.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2a_b mapped name: layer2.3.conv1.bias
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2a_bn_b mapped name: layer2.3.bn1.bias
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2a_bn_s mapped name: layer2.3.bn1.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2a_w mapped name: layer2.3.conv1.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2b_b mapped name: layer2.3.conv2.bias
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2b_bn_b mapped name: layer2.3.bn2.bias
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2b_bn_s mapped name: layer2.3.bn2.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2b_w mapped name: layer2.3.conv2.weight
2020-08-04 11:05:33,403 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2c_b mapped name: layer2.3.conv3.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2c_bn_b mapped name: layer2.3.bn3.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2c_bn_s mapped name: layer2.3.bn3.weight
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res3_3_branch2c_w mapped name: layer2.3.conv3.weight
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch1_b mapped name: layer3.0.downsample.0.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch1_bn_b mapped name: layer3.0.downsample.1.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch1_bn_s mapped name: layer3.0.downsample.1.weight
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch1_w mapped name: layer3.0.downsample.0.weight
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2a_b mapped name: layer3.0.conv1.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2a_bn_b mapped name: layer3.0.bn1.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2a_bn_s mapped name: layer3.0.bn1.weight
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2a_w mapped name: layer3.0.conv1.weight
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2b_b mapped name: layer3.0.conv2.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2b_bn_b mapped name: layer3.0.bn2.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2b_bn_s mapped name: layer3.0.bn2.weight
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2b_w mapped name: layer3.0.conv2.weight
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2c_b mapped name: layer3.0.conv3.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2c_bn_b mapped name: layer3.0.bn3.bias
2020-08-04 11:05:33,404 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2c_bn_s mapped name: layer3.0.bn3.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_0_branch2c_w mapped name: layer3.0.conv3.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2a_b mapped name: layer3.1.conv1.bias
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2a_bn_b mapped name: layer3.1.bn1.bias
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2a_bn_s mapped name: layer3.1.bn1.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2a_w mapped name: layer3.1.conv1.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2b_b mapped name: layer3.1.conv2.bias
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2b_bn_b mapped name: layer3.1.bn2.bias
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2b_bn_s mapped name: layer3.1.bn2.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2b_w mapped name: layer3.1.conv2.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2c_b mapped name: layer3.1.conv3.bias
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2c_bn_b mapped name: layer3.1.bn3.bias
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2c_bn_s mapped name: layer3.1.bn3.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_1_branch2c_w mapped name: layer3.1.conv3.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2a_b mapped name: layer3.2.conv1.bias
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2a_bn_b mapped name: layer3.2.bn1.bias
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2a_bn_s mapped name: layer3.2.bn1.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2a_w mapped name: layer3.2.conv1.weight
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2b_b mapped name: layer3.2.conv2.bias
2020-08-04 11:05:33,405 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2b_bn_b mapped name: layer3.2.bn2.bias
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2b_bn_s mapped name: layer3.2.bn2.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2b_w mapped name: layer3.2.conv2.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2c_b mapped name: layer3.2.conv3.bias
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2c_bn_b mapped name: layer3.2.bn3.bias
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2c_bn_s mapped name: layer3.2.bn3.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_2_branch2c_w mapped name: layer3.2.conv3.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2a_b mapped name: layer3.3.conv1.bias
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2a_bn_b mapped name: layer3.3.bn1.bias
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2a_bn_s mapped name: layer3.3.bn1.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2a_w mapped name: layer3.3.conv1.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2b_b mapped name: layer3.3.conv2.bias
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2b_bn_b mapped name: layer3.3.bn2.bias
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2b_bn_s mapped name: layer3.3.bn2.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2b_w mapped name: layer3.3.conv2.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2c_b mapped name: layer3.3.conv3.bias
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2c_bn_b mapped name: layer3.3.bn3.bias
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2c_bn_s mapped name: layer3.3.bn3.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_3_branch2c_w mapped name: layer3.3.conv3.weight
2020-08-04 11:05:33,406 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2a_b mapped name: layer3.4.conv1.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2a_bn_b mapped name: layer3.4.bn1.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2a_bn_s mapped name: layer3.4.bn1.weight
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2a_w mapped name: layer3.4.conv1.weight
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2b_b mapped name: layer3.4.conv2.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2b_bn_b mapped name: layer3.4.bn2.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2b_bn_s mapped name: layer3.4.bn2.weight
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2b_w mapped name: layer3.4.conv2.weight
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2c_b mapped name: layer3.4.conv3.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2c_bn_b mapped name: layer3.4.bn3.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2c_bn_s mapped name: layer3.4.bn3.weight
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_4_branch2c_w mapped name: layer3.4.conv3.weight
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2a_b mapped name: layer3.5.conv1.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2a_bn_b mapped name: layer3.5.bn1.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2a_bn_s mapped name: layer3.5.bn1.weight
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2a_w mapped name: layer3.5.conv1.weight
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2b_b mapped name: layer3.5.conv2.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2b_bn_b mapped name: layer3.5.bn2.bias
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2b_bn_s mapped name: layer3.5.bn2.weight
2020-08-04 11:05:33,407 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2b_w mapped name: layer3.5.conv2.weight
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2c_b mapped name: layer3.5.conv3.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2c_bn_b mapped name: layer3.5.bn3.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2c_bn_s mapped name: layer3.5.bn3.weight
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res4_5_branch2c_w mapped name: layer3.5.conv3.weight
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch1_b mapped name: layer4.0.downsample.0.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch1_bn_b mapped name: layer4.0.downsample.1.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch1_bn_s mapped name: layer4.0.downsample.1.weight
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch1_w mapped name: layer4.0.downsample.0.weight
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2a_b mapped name: layer4.0.conv1.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2a_bn_b mapped name: layer4.0.bn1.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2a_bn_s mapped name: layer4.0.bn1.weight
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2a_w mapped name: layer4.0.conv1.weight
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2b_b mapped name: layer4.0.conv2.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2b_bn_b mapped name: layer4.0.bn2.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2b_bn_s mapped name: layer4.0.bn2.weight
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2b_w mapped name: layer4.0.conv2.weight
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2c_b mapped name: layer4.0.conv3.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2c_bn_b mapped name: layer4.0.bn3.bias
2020-08-04 11:05:33,408 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2c_bn_s mapped name: layer4.0.bn3.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_0_branch2c_w mapped name: layer4.0.conv3.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2a_b mapped name: layer4.1.conv1.bias
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2a_bn_b mapped name: layer4.1.bn1.bias
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2a_bn_s mapped name: layer4.1.bn1.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2a_w mapped name: layer4.1.conv1.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2b_b mapped name: layer4.1.conv2.bias
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2b_bn_b mapped name: layer4.1.bn2.bias
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2b_bn_s mapped name: layer4.1.bn2.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2b_w mapped name: layer4.1.conv2.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2c_b mapped name: layer4.1.conv3.bias
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2c_bn_b mapped name: layer4.1.bn3.bias
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2c_bn_s mapped name: layer4.1.bn3.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_1_branch2c_w mapped name: layer4.1.conv3.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2a_b mapped name: layer4.2.conv1.bias
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2a_bn_b mapped name: layer4.2.bn1.bias
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2a_bn_s mapped name: layer4.2.bn1.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2a_w mapped name: layer4.2.conv1.weight
2020-08-04 11:05:33,409 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2b_b mapped name: layer4.2.conv2.bias
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2b_bn_b mapped name: layer4.2.bn2.bias
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2b_bn_s mapped name: layer4.2.bn2.weight
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2b_w mapped name: layer4.2.conv2.weight
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2c_b mapped name: layer4.2.conv3.bias
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2c_bn_b mapped name: layer4.2.bn3.bias
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2c_bn_s mapped name: layer4.2.bn3.weight
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res5_2_branch2c_w mapped name: layer4.2.conv3.weight
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res_conv1_bn_b mapped name: bn1.bias
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: C2 name: res_conv1_bn_s mapped name: bn1.weight
2020-08-04 11:05:33,410 maskrcnn_benchmark.utils.c2_model_loading INFO: Remapping conv weights for deformable conv weights
2020-08-04 11:05:33,877 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.bn1.bias loaded from layer1.0.bn1.bias of shape (64,)
2020-08-04 11:05:33,892 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.bn1.weight loaded from layer1.0.bn1.weight of shape (64,)
2020-08-04 11:05:33,892 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.bn2.bias loaded from layer1.0.bn2.bias of shape (64,)
2020-08-04 11:05:33,892 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.bn2.weight loaded from layer1.0.bn2.weight of shape (64,)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.bn3.bias loaded from layer1.0.bn3.bias of shape (256,)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.bn3.weight loaded from layer1.0.bn3.weight of shape (256,)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.conv1.weight loaded from layer1.0.conv1.weight of shape (64, 64, 1, 1)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.conv2.weight loaded from layer1.0.conv2.weight of shape (64, 64, 3, 3)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.conv3.weight loaded from layer1.0.conv3.weight of shape (256, 64, 1, 1)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.downsample.0.weight loaded from layer1.0.downsample.0.weight of shape (256, 64, 1, 1)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.downsample.1.bias loaded from layer1.0.downsample.1.bias of shape (256,)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.0.downsample.1.weight loaded from layer1.0.downsample.1.weight of shape (256,)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.1.bn1.bias loaded from layer1.1.bn1.bias of shape (64,)
2020-08-04 11:05:33,893 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.1.bn1.weight loaded from layer1.1.bn1.weight of shape (64,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.1.bn2.bias loaded from layer1.1.bn2.bias of shape (64,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.1.bn2.weight loaded from layer1.1.bn2.weight of shape (64,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.1.bn3.bias loaded from layer1.1.bn3.bias of shape (256,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.1.bn3.weight loaded from layer1.1.bn3.weight of shape (256,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.1.conv1.weight loaded from layer1.1.conv1.weight of shape (64, 256, 1, 1)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.1.conv2.weight loaded from layer1.1.conv2.weight of shape (64, 64, 3, 3)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.1.conv3.weight loaded from layer1.1.conv3.weight of shape (256, 64, 1, 1)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.2.bn1.bias loaded from layer1.2.bn1.bias of shape (64,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.2.bn1.weight loaded from layer1.2.bn1.weight of shape (64,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.2.bn2.bias loaded from layer1.2.bn2.bias of shape (64,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.2.bn2.weight loaded from layer1.2.bn2.weight of shape (64,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.2.bn3.bias loaded from layer1.2.bn3.bias of shape (256,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.2.bn3.weight loaded from layer1.2.bn3.weight of shape (256,)
2020-08-04 11:05:33,894 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.2.conv1.weight loaded from layer1.2.conv1.weight of shape (64, 256, 1, 1)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.2.conv2.weight loaded from layer1.2.conv2.weight of shape (64, 64, 3, 3)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer1.2.conv3.weight loaded from layer1.2.conv3.weight of shape (256, 64, 1, 1)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.bn1.bias loaded from layer2.0.bn1.bias of shape (128,)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.bn1.weight loaded from layer2.0.bn1.weight of shape (128,)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.bn2.bias loaded from layer2.0.bn2.bias of shape (128,)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.bn2.weight loaded from layer2.0.bn2.weight of shape (128,)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.bn3.bias loaded from layer2.0.bn3.bias of shape (512,)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.bn3.weight loaded from layer2.0.bn3.weight of shape (512,)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.conv1.weight loaded from layer2.0.conv1.weight of shape (128, 256, 1, 1)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.conv2.weight loaded from layer2.0.conv2.weight of shape (128, 128, 3, 3)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.conv3.weight loaded from layer2.0.conv3.weight of shape (512, 128, 1, 1)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.downsample.0.weight loaded from layer2.0.downsample.0.weight of shape (512, 256, 1, 1)
2020-08-04 11:05:33,895 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.downsample.1.bias loaded from layer2.0.downsample.1.bias of shape (512,)
2020-08-04 11:05:33,896 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.0.downsample.1.weight loaded from layer2.0.downsample.1.weight of shape (512,)
2020-08-04 11:05:33,896 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.1.bn1.bias loaded from layer2.1.bn1.bias of shape (128,)
2020-08-04 11:05:33,896 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.1.bn1.weight loaded from layer2.1.bn1.weight of shape (128,)
2020-08-04 11:05:33,896 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.1.bn2.bias loaded from layer2.1.bn2.bias of shape (128,)
2020-08-04 11:05:33,896 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.1.bn2.weight loaded from layer2.1.bn2.weight of shape (128,)
2020-08-04 11:05:33,896 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.1.bn3.bias loaded from layer2.1.bn3.bias of shape (512,)
2020-08-04 11:05:33,896 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.1.bn3.weight loaded from layer2.1.bn3.weight of shape (512,)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.1.conv1.weight loaded from layer2.1.conv1.weight of shape (128, 512, 1, 1)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.1.conv2.weight loaded from layer2.1.conv2.weight of shape (128, 128, 3, 3)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.1.conv3.weight loaded from layer2.1.conv3.weight of shape (512, 128, 1, 1)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.2.bn1.bias loaded from layer2.2.bn1.bias of shape (128,)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.2.bn1.weight loaded from layer2.2.bn1.weight of shape (128,)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.2.bn2.bias loaded from layer2.2.bn2.bias of shape (128,)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.2.bn2.weight loaded from layer2.2.bn2.weight of shape (128,)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.2.bn3.bias loaded from layer2.2.bn3.bias of shape (512,)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.2.bn3.weight loaded from layer2.2.bn3.weight of shape (512,)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.2.conv1.weight loaded from layer2.2.conv1.weight of shape (128, 512, 1, 1)
2020-08-04 11:05:33,917 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.2.conv2.weight loaded from layer2.2.conv2.weight of shape (128, 128, 3, 3)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.2.conv3.weight loaded from layer2.2.conv3.weight of shape (512, 128, 1, 1)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.3.bn1.bias loaded from layer2.3.bn1.bias of shape (128,)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.3.bn1.weight loaded from layer2.3.bn1.weight of shape (128,)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.3.bn2.bias loaded from layer2.3.bn2.bias of shape (128,)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.3.bn2.weight loaded from layer2.3.bn2.weight of shape (128,)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.3.bn3.bias loaded from layer2.3.bn3.bias of shape (512,)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.3.bn3.weight loaded from layer2.3.bn3.weight of shape (512,)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.3.conv1.weight loaded from layer2.3.conv1.weight of shape (128, 512, 1, 1)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.3.conv2.weight loaded from layer2.3.conv2.weight of shape (128, 128, 3, 3)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer2.3.conv3.weight loaded from layer2.3.conv3.weight of shape (512, 128, 1, 1)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.bn1.bias loaded from layer3.0.bn1.bias of shape (256,)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.bn1.weight loaded from layer3.0.bn1.weight of shape (256,)
2020-08-04 11:05:33,925 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.bn2.bias loaded from layer3.0.bn2.bias of shape (256,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.bn2.weight loaded from layer3.0.bn2.weight of shape (256,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.bn3.bias loaded from layer3.0.bn3.bias of shape (1024,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.bn3.weight loaded from layer3.0.bn3.weight of shape (1024,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.conv1.weight loaded from layer3.0.conv1.weight of shape (256, 512, 1, 1)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.conv2.weight loaded from layer3.0.conv2.weight of shape (256, 256, 3, 3)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.conv3.weight loaded from layer3.0.conv3.weight of shape (1024, 256, 1, 1)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.downsample.0.weight loaded from layer3.0.downsample.0.weight of shape (1024, 512, 1, 1)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.downsample.1.bias loaded from layer3.0.downsample.1.bias of shape (1024,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.0.downsample.1.weight loaded from layer3.0.downsample.1.weight of shape (1024,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.1.bn1.bias loaded from layer3.1.bn1.bias of shape (256,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.1.bn1.weight loaded from layer3.1.bn1.weight of shape (256,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.1.bn2.bias loaded from layer3.1.bn2.bias of shape (256,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.1.bn2.weight loaded from layer3.1.bn2.weight of shape (256,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.1.bn3.bias loaded from layer3.1.bn3.bias of shape (1024,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.1.bn3.weight loaded from layer3.1.bn3.weight of shape (1024,)
2020-08-04 11:05:33,926 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.1.conv1.weight loaded from layer3.1.conv1.weight of shape (256, 1024, 1, 1)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.1.conv2.weight loaded from layer3.1.conv2.weight of shape (256, 256, 3, 3)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.1.conv3.weight loaded from layer3.1.conv3.weight of shape (1024, 256, 1, 1)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.2.bn1.bias loaded from layer3.2.bn1.bias of shape (256,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.2.bn1.weight loaded from layer3.2.bn1.weight of shape (256,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.2.bn2.bias loaded from layer3.2.bn2.bias of shape (256,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.2.bn2.weight loaded from layer3.2.bn2.weight of shape (256,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.2.bn3.bias loaded from layer3.2.bn3.bias of shape (1024,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.2.bn3.weight loaded from layer3.2.bn3.weight of shape (1024,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.2.conv1.weight loaded from layer3.2.conv1.weight of shape (256, 1024, 1, 1)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.2.conv2.weight loaded from layer3.2.conv2.weight of shape (256, 256, 3, 3)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.2.conv3.weight loaded from layer3.2.conv3.weight of shape (1024, 256, 1, 1)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.3.bn1.bias loaded from layer3.3.bn1.bias of shape (256,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.3.bn1.weight loaded from layer3.3.bn1.weight of shape (256,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.3.bn2.bias loaded from layer3.3.bn2.bias of shape (256,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.3.bn2.weight loaded from layer3.3.bn2.weight of shape (256,)
2020-08-04 11:05:33,927 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.3.bn3.bias loaded from layer3.3.bn3.bias of shape (1024,)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.3.bn3.weight loaded from layer3.3.bn3.weight of shape (1024,)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.3.conv1.weight loaded from layer3.3.conv1.weight of shape (256, 1024, 1, 1)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.3.conv2.weight loaded from layer3.3.conv2.weight of shape (256, 256, 3, 3)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.3.conv3.weight loaded from layer3.3.conv3.weight of shape (1024, 256, 1, 1)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.4.bn1.bias loaded from layer3.4.bn1.bias of shape (256,)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.4.bn1.weight loaded from layer3.4.bn1.weight of shape (256,)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.4.bn2.bias loaded from layer3.4.bn2.bias of shape (256,)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.4.bn2.weight loaded from layer3.4.bn2.weight of shape (256,)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.4.bn3.bias loaded from layer3.4.bn3.bias of shape (1024,)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.4.bn3.weight loaded from layer3.4.bn3.weight of shape (1024,)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.4.conv1.weight loaded from layer3.4.conv1.weight of shape (256, 1024, 1, 1)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.4.conv2.weight loaded from layer3.4.conv2.weight of shape (256, 256, 3, 3)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.4.conv3.weight loaded from layer3.4.conv3.weight of shape (1024, 256, 1, 1)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.5.bn1.bias loaded from layer3.5.bn1.bias of shape (256,)
2020-08-04 11:05:33,928 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.5.bn1.weight loaded from layer3.5.bn1.weight of shape (256,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.5.bn2.bias loaded from layer3.5.bn2.bias of shape (256,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.5.bn2.weight loaded from layer3.5.bn2.weight of shape (256,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.5.bn3.bias loaded from layer3.5.bn3.bias of shape (1024,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.5.bn3.weight loaded from layer3.5.bn3.weight of shape (1024,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.5.conv1.weight loaded from layer3.5.conv1.weight of shape (256, 1024, 1, 1)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.5.conv2.weight loaded from layer3.5.conv2.weight of shape (256, 256, 3, 3)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer3.5.conv3.weight loaded from layer3.5.conv3.weight of shape (1024, 256, 1, 1)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.bn1.bias loaded from layer4.0.bn1.bias of shape (512,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.bn1.weight loaded from layer4.0.bn1.weight of shape (512,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.bn2.bias loaded from layer4.0.bn2.bias of shape (512,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.bn2.weight loaded from layer4.0.bn2.weight of shape (512,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.bn3.bias loaded from layer4.0.bn3.bias of shape (2048,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.bn3.weight loaded from layer4.0.bn3.weight of shape (2048,)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.conv1.weight loaded from layer4.0.conv1.weight of shape (512, 1024, 1, 1)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.conv2.weight loaded from layer4.0.conv2.weight of shape (512, 512, 3, 3)
2020-08-04 11:05:33,929 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.conv3.weight loaded from layer4.0.conv3.weight of shape (2048, 512, 1, 1)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.downsample.0.weight loaded from layer4.0.downsample.0.weight of shape (2048, 1024, 1, 1)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.downsample.1.bias loaded from layer4.0.downsample.1.bias of shape (2048,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.0.downsample.1.weight loaded from layer4.0.downsample.1.weight of shape (2048,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.1.bn1.bias loaded from layer4.1.bn1.bias of shape (512,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.1.bn1.weight loaded from layer4.1.bn1.weight of shape (512,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.1.bn2.bias loaded from layer4.1.bn2.bias of shape (512,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.1.bn2.weight loaded from layer4.1.bn2.weight of shape (512,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.1.bn3.bias loaded from layer4.1.bn3.bias of shape (2048,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.1.bn3.weight loaded from layer4.1.bn3.weight of shape (2048,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.1.conv1.weight loaded from layer4.1.conv1.weight of shape (512, 2048, 1, 1)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.1.conv2.weight loaded from layer4.1.conv2.weight of shape (512, 512, 3, 3)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.1.conv3.weight loaded from layer4.1.conv3.weight of shape (2048, 512, 1, 1)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.2.bn1.bias loaded from layer4.2.bn1.bias of shape (512,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.2.bn1.weight loaded from layer4.2.bn1.weight of shape (512,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.2.bn2.bias loaded from layer4.2.bn2.bias of shape (512,)
2020-08-04 11:05:33,930 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.2.bn2.weight loaded from layer4.2.bn2.weight of shape (512,)
2020-08-04 11:05:33,931 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.2.bn3.bias loaded from layer4.2.bn3.bias of shape (2048,)
2020-08-04 11:05:33,931 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.2.bn3.weight loaded from layer4.2.bn3.weight of shape (2048,)
2020-08-04 11:05:33,931 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.2.conv1.weight loaded from layer4.2.conv1.weight of shape (512, 2048, 1, 1)
2020-08-04 11:05:33,931 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.2.conv2.weight loaded from layer4.2.conv2.weight of shape (512, 512, 3, 3)
2020-08-04 11:05:33,931 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.layer4.2.conv3.weight loaded from layer4.2.conv3.weight of shape (2048, 512, 1, 1)
2020-08-04 11:05:33,931 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.stem.bn1.bias loaded from bn1.bias of shape (64,)
2020-08-04 11:05:33,931 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.stem.bn1.weight loaded from bn1.weight of shape (64,)
2020-08-04 11:05:33,931 maskrcnn_benchmark.utils.model_serialization INFO: module.backbone.body.stem.conv1.weight loaded from conv1.weight of shape (64, 3, 7, 7)
2020-08-04 11:05:34,060 maskrcnn_benchmark.data.build WARNING: When using more than one image per GPU you may encounter an out-of-memory (OOM) error if your GPU does not have sufficient memory. If this happens, you can reduce SOLVER.IMS_PER_BATCH (for training) or TEST.IMS_PER_BATCH (for inference). For training, you must also adjust the learning rate and schedule length according to the linear scaling rule. See for example: https://github.com/facebookresearch/Detectron/blob/master/configs/getting_started/tutorial_1gpu_e2e_faster_rcnn_R-50-FPN.yaml#L14
2020-08-04 11:05:37,189 maskrcnn_benchmark.utils.miscellaneous WARNING: Dataset [COCOCaptionsDataset] has no categories attribute, labels.json file won't be created
2020-08-04 11:05:37,419 maskrcnn_benchmark.data.build WARNING: When using more than one image per GPU you may encounter an out-of-memory (OOM) error if your GPU does not have sufficient memory. If this happens, you can reduce SOLVER.IMS_PER_BATCH (for training) or TEST.IMS_PER_BATCH (for inference). For training, you must also adjust the learning rate and schedule length according to the linear scaling rule. See for example: https://github.com/facebookresearch/Detectron/blob/master/configs/getting_started/tutorial_1gpu_e2e_faster_rcnn_R-50-FPN.yaml#L14
2020-08-04 11:05:37,489 maskrcnn_benchmark.trainer INFO: Start training
2020-08-04 11:06:49,898 maskrcnn_benchmark.trainer INFO: eta: 8:01:27 iter: 100 Cross-Entropy Loss (Align Regions, Choose Caption): 1.6825 (1.9629) Cross-Entropy Loss (Align Regions, Choose Image): 1.6405 (1.9497) Cross-Entropy Loss (Align Words, Choose Caption): 1.8226 (2.0018) Cross-Entropy Loss (Align Words, Choose Image): 1.7348 (1.9750) Image Caption Matching Loss: 4.1483 (4.1557) Masked Language Modeling Loss: 4.6826 (5.5908) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 15.8034 (17.6360) Batch Accuracy (Align Regions, Choose Caption): 0.3750 (0.2195) Batch Accuracy (Align Regions, Choose Image): 0.3906 (0.2242) Batch Accuracy (Align Words, Choose Caption): 0.3438 (0.2080) Batch Accuracy (Align Words, Choose Image): 0.3750 (0.2255) Batch Accuracy (Choose Caption): 0.1562 (0.1300) Batch Accuracy (Choose Image): 0.1250 (0.1322) Masked Language Modeling Accuracy: 0.3336 (0.2439) time: 0.6759 (0.7240) data: 0.0395 (0.0535) lr: 0.004653 max mem: 10750
2020-08-04 11:07:58,221 maskrcnn_benchmark.trainer INFO: eta: 7:46:43 iter: 200 Cross-Entropy Loss (Align Regions, Choose Caption): 1.0699 (1.5826) Cross-Entropy Loss (Align Regions, Choose Image): 0.9873 (1.5649) Cross-Entropy Loss (Align Words, Choose Caption): 1.2449 (1.7080) Cross-Entropy Loss (Align Words, Choose Image): 1.0132 (1.5988) Image Caption Matching Loss: 4.0515 (4.1294) Masked Language Modeling Loss: 4.1125 (4.9744) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 12.5578 (15.5581) Batch Accuracy (Align Regions, Choose Caption): 0.6406 (0.3958) Batch Accuracy (Align Regions, Choose Image): 0.6250 (0.3980) Batch Accuracy (Align Words, Choose Caption): 0.5781 (0.3534) Batch Accuracy (Align Words, Choose Image): 0.6562 (0.3999) Batch Accuracy (Choose Caption): 0.2500 (0.1593) Batch Accuracy (Choose Image): 0.2188 (0.1584) Masked Language Modeling Accuracy: 0.3844 (0.3000) time: 0.6827 (0.7036) data: 0.0383 (0.0463) lr: 0.005987 max mem: 10750
2020-08-04 11:09:06,653 maskrcnn_benchmark.trainer INFO: eta: 7:41:18 iter: 300 Cross-Entropy Loss (Align Regions, Choose Caption): 0.9103 (1.3632) Cross-Entropy Loss (Align Regions, Choose Image): 0.8133 (1.3498) Cross-Entropy Loss (Align Words, Choose Caption): 1.0594 (1.5118) Cross-Entropy Loss (Align Words, Choose Image): 0.8226 (1.3664) Image Caption Matching Loss: 3.7361 (4.0607) Masked Language Modeling Loss: 3.8365 (4.6433) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 11.2430 (14.2952) Batch Accuracy (Align Regions, Choose Caption): 0.7188 (0.4904) Batch Accuracy (Align Regions, Choose Image): 0.7031 (0.4920) Batch Accuracy (Align Words, Choose Caption): 0.6406 (0.4429) Batch Accuracy (Align Words, Choose Image): 0.7188 (0.4990) Batch Accuracy (Choose Caption): 0.2344 (0.1877) Batch Accuracy (Choose Image): 0.2500 (0.1837) Masked Language Modeling Accuracy: 0.4162 (0.3327) time: 0.6842 (0.6972) data: 0.0409 (0.0440) lr: 0.007320 max mem: 10990
2020-08-04 11:10:14,459 maskrcnn_benchmark.trainer INFO: eta: 7:36:59 iter: 400 Cross-Entropy Loss (Align Regions, Choose Caption): 0.7567 (1.2321) Cross-Entropy Loss (Align Regions, Choose Image): 0.7568 (1.2181) Cross-Entropy Loss (Align Words, Choose Caption): 0.9849 (1.3883) Cross-Entropy Loss (Align Words, Choose Image): 0.7574 (1.2242) Image Caption Matching Loss: 3.2667 (3.8996) Masked Language Modeling Loss: 3.5416 (4.3997) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 10.1175 (13.3620) Batch Accuracy (Align Regions, Choose Caption): 0.7344 (0.5463) Batch Accuracy (Align Regions, Choose Image): 0.7500 (0.5477) Batch Accuracy (Align Words, Choose Caption): 0.6875 (0.4993) Batch Accuracy (Align Words, Choose Image): 0.7500 (0.5582) Batch Accuracy (Choose Caption): 0.3281 (0.2214) Batch Accuracy (Choose Image): 0.3125 (0.2157) Masked Language Modeling Accuracy: 0.4270 (0.3564) time: 0.6761 (0.6924) data: 0.0391 (0.0427) lr: 0.008653 max mem: 10990
2020-08-04 11:11:21,972 maskrcnn_benchmark.trainer INFO: eta: 7:33:33 iter: 500 Cross-Entropy Loss (Align Regions, Choose Caption): 0.7263 (1.1427) Cross-Entropy Loss (Align Regions, Choose Image): 0.7552 (1.1284) Cross-Entropy Loss (Align Words, Choose Caption): 0.8545 (1.3019) Cross-Entropy Loss (Align Words, Choose Image): 0.6839 (1.1265) Image Caption Matching Loss: 2.5992 (3.6966) Masked Language Modeling Loss: 3.4270 (4.2172) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 9.0821 (12.6134) Batch Accuracy (Align Regions, Choose Caption): 0.7344 (0.5829) Batch Accuracy (Align Regions, Choose Image): 0.7500 (0.5862) Batch Accuracy (Align Words, Choose Caption): 0.6875 (0.5356) Batch Accuracy (Align Words, Choose Image): 0.7500 (0.5972) Batch Accuracy (Choose Caption): 0.4844 (0.2655) Batch Accuracy (Choose Image): 0.4688 (0.2580) Masked Language Modeling Accuracy: 0.4569 (0.3741) time: 0.6718 (0.6890) data: 0.0376 (0.0419) lr: 0.009987 max mem: 10990
2020-08-04 11:12:29,637 maskrcnn_benchmark.trainer INFO: eta: 7:31:03 iter: 600 Cross-Entropy Loss (Align Regions, Choose Caption): 0.8109 (1.0814) Cross-Entropy Loss (Align Regions, Choose Image): 0.7675 (1.0656) Cross-Entropy Loss (Align Words, Choose Caption): 0.8851 (1.2345) Cross-Entropy Loss (Align Words, Choose Image): 0.7401 (1.0554) Image Caption Matching Loss: 2.2591 (3.4643) Masked Language Modeling Loss: 3.2249 (4.0671) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 8.6563 (11.9681) Batch Accuracy (Align Regions, Choose Caption): 0.7500 (0.6089) Batch Accuracy (Align Regions, Choose Image): 0.7188 (0.6133) Batch Accuracy (Align Words, Choose Caption): 0.7031 (0.5637) Batch Accuracy (Align Words, Choose Image): 0.7344 (0.6255) Batch Accuracy (Choose Caption): 0.5469 (0.3110) Batch Accuracy (Choose Image): 0.5469 (0.3041) Masked Language Modeling Accuracy: 0.4715 (0.3894) time: 0.6740 (0.6869) data: 0.0381 (0.0413) lr: 0.010000 max mem: 11029
2020-08-04 11:13:37,207 maskrcnn_benchmark.trainer INFO: eta: 7:28:52 iter: 700 Cross-Entropy Loss (Align Regions, Choose Caption): 0.7495 (1.0351) Cross-Entropy Loss (Align Regions, Choose Image): 0.7199 (1.0188) Cross-Entropy Loss (Align Words, Choose Caption): 0.8646 (1.1872) Cross-Entropy Loss (Align Words, Choose Image): 0.6466 (1.0038) Image Caption Matching Loss: 1.9027 (3.2569) Masked Language Modeling Loss: 3.1395 (3.9443) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 7.9662 (11.4461) Batch Accuracy (Align Regions, Choose Caption): 0.7344 (0.6275) Batch Accuracy (Align Regions, Choose Image): 0.7500 (0.6336) Batch Accuracy (Align Words, Choose Caption): 0.7031 (0.5822) Batch Accuracy (Align Words, Choose Image): 0.7500 (0.6457) Batch Accuracy (Choose Caption): 0.6094 (0.3527) Batch Accuracy (Choose Image): 0.6094 (0.3459) Masked Language Modeling Accuracy: 0.4806 (0.4022) time: 0.6730 (0.6853) data: 0.0370 (0.0409) lr: 0.010000 max mem: 11668
2020-08-04 11:14:44,629 maskrcnn_benchmark.trainer INFO: eta: 7:26:49 iter: 800 Cross-Entropy Loss (Align Regions, Choose Caption): 0.7765 (0.9941) Cross-Entropy Loss (Align Regions, Choose Image): 0.7362 (0.9768) Cross-Entropy Loss (Align Words, Choose Caption): 0.8146 (1.1391) Cross-Entropy Loss (Align Words, Choose Image): 0.6845 (0.9586) Image Caption Matching Loss: 1.7244 (3.0742) Masked Language Modeling Loss: 2.9407 (3.8268) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 7.7543 (10.9696) Batch Accuracy (Align Regions, Choose Caption): 0.7500 (0.6444) Batch Accuracy (Align Regions, Choose Image): 0.7500 (0.6499) Batch Accuracy (Align Words, Choose Caption): 0.7031 (0.6006) Batch Accuracy (Align Words, Choose Image): 0.7969 (0.6637) Batch Accuracy (Choose Caption): 0.6719 (0.3895) Batch Accuracy (Choose Image): 0.6562 (0.3834) Masked Language Modeling Accuracy: 0.5045 (0.4146) time: 0.6723 (0.6839) data: 0.0390 (0.0405) lr: 0.010000 max mem: 11668
2020-08-04 11:15:52,211 maskrcnn_benchmark.trainer INFO: eta: 7:25:05 iter: 900 Cross-Entropy Loss (Align Regions, Choose Caption): 0.6775 (0.9608) Cross-Entropy Loss (Align Regions, Choose Image): 0.6389 (0.9432) Cross-Entropy Loss (Align Words, Choose Caption): 0.7341 (1.1026) Cross-Entropy Loss (Align Words, Choose Image): 0.6375 (0.9226) Image Caption Matching Loss: 1.6405 (2.9178) Masked Language Modeling Loss: 2.8683 (3.7332) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 7.1467 (10.5804) Batch Accuracy (Align Regions, Choose Caption): 0.7656 (0.6574) Batch Accuracy (Align Regions, Choose Image): 0.7656 (0.6635) Batch Accuracy (Align Words, Choose Caption): 0.7344 (0.6151) Batch Accuracy (Align Words, Choose Image): 0.7812 (0.6778) Batch Accuracy (Choose Caption): 0.6719 (0.4216) Batch Accuracy (Choose Image): 0.6875 (0.4158) Masked Language Modeling Accuracy: 0.5068 (0.4242) time: 0.6750 (0.6830) data: 0.0380 (0.0403) lr: 0.010000 max mem: 11668
2020-08-04 11:16:59,878 maskrcnn_benchmark.trainer INFO: eta: 7:23:32 iter: 1000 Cross-Entropy Loss (Align Regions, Choose Caption): 0.6644 (0.9350) Cross-Entropy Loss (Align Regions, Choose Image): 0.6102 (0.9167) Cross-Entropy Loss (Align Words, Choose Caption): 0.7471 (1.0709) Cross-Entropy Loss (Align Words, Choose Image): 0.6171 (0.8921) Image Caption Matching Loss: 1.5533 (2.7840) Masked Language Modeling Loss: 2.7717 (3.6484) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 7.0535 (10.2470) Batch Accuracy (Align Regions, Choose Caption): 0.7656 (0.6678) Batch Accuracy (Align Regions, Choose Image): 0.7656 (0.6745) Batch Accuracy (Align Words, Choose Caption): 0.7344 (0.6273) Batch Accuracy (Align Words, Choose Image): 0.7812 (0.6889) Batch Accuracy (Choose Caption): 0.6875 (0.4488) Batch Accuracy (Choose Image): 0.6875 (0.4439) Masked Language Modeling Accuracy: 0.5148 (0.4330) time: 0.6745 (0.6824) data: 0.0379 (0.0401) lr: 0.010000 max mem: 11668
2020-08-04 11:17:00,196 maskrcnn_benchmark.utils.checkpoint INFO: Saving checkpoint to /home/alireza/runs/vltrain/121/model_0001000.pth
This goes on for a while and finally we have:
2020-08-04 19:17:53,792 maskrcnn_benchmark.trainer INFO: eta: 0:01:14 iter: 39900 Cross-Entropy Loss (Align Regions, Choose Caption): 0.2195 (0.3917) Cross-Entropy Loss (Align Regions, Choose Image): 0.2311 (0.3784) Cross-Entropy Loss (Align Words, Choose Caption): 0.2003 (0.3690) Cross-Entropy Loss (Align Words, Choose Image): 0.1716 (0.3110) Image Caption Matching Loss: 0.2174 (0.4902) Masked Language Modeling Loss: 1.7325 (1.8332) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 2.7870 (3.7736) Batch Accuracy (Align Regions, Choose Caption): 0.9062 (0.8691) Batch Accuracy (Align Regions, Choose Image): 0.9219 (0.8714) Batch Accuracy (Align Words, Choose Caption): 0.9375 (0.8730) Batch Accuracy (Align Words, Choose Image): 0.9219 (0.8922) Batch Accuracy (Choose Caption): 0.9531 (0.9049) Batch Accuracy (Choose Image): 0.9531 (0.9052) Masked Language Modeling Accuracy: 0.6429 (0.6386) time: 0.6685 (0.7403) data: 0.0360 (0.1031) lr: 0.000100 max mem: 12152
2020-08-04 19:19:00,867 maskrcnn_benchmark.trainer INFO: eta: 0:00:00 iter: 40000 Cross-Entropy Loss (Align Regions, Choose Caption): 0.2684 (0.3914) Cross-Entropy Loss (Align Regions, Choose Image): 0.2396 (0.3780) Cross-Entropy Loss (Align Words, Choose Caption): 0.2185 (0.3686) Cross-Entropy Loss (Align Words, Choose Image): 0.1979 (0.3107) Image Caption Matching Loss: 0.2538 (0.4896) Masked Language Modeling Loss: 1.4964 (1.8325) Masked Visual Modeling Loss: 0.0000 (0.0000) loss: 2.6669 (3.7707) Batch Accuracy (Align Regions, Choose Caption): 0.8906 (0.8692) Batch Accuracy (Align Regions, Choose Image): 0.9219 (0.8715) Batch Accuracy (Align Words, Choose Caption): 0.9219 (0.8731) Batch Accuracy (Align Words, Choose Image): 0.9219 (0.8923) Batch Accuracy (Choose Caption): 0.9531 (0.9050) Batch Accuracy (Choose Image): 0.9531 (0.9053) Masked Language Modeling Accuracy: 0.6903 (0.6387) time: 0.6707 (0.7401) data: 0.0342 (0.1029) lr: 0.000100 max mem: 12152
2020-08-04 19:19:01,169 maskrcnn_benchmark.utils.checkpoint INFO: Saving checkpoint to /home/alireza/runs/vltrain/121/model_0040000.pth
2020-08-04 19:20:08,973 maskrcnn_benchmark.trainer INFO: [Validation]: eta: 0:00:00 iter: 40000 loss: 0.8663 (0.8559) Cross-Entropy Loss (Align Regions, Choose Caption): 0.1506 (0.1652) Cross-Entropy Loss (Align Regions, Choose Image): 0.1740 (0.1847) Cross-Entropy Loss (Align Words, Choose Caption): 0.1662 (0.1719) Cross-Entropy Loss (Align Words, Choose Image): 0.1187 (0.1481) Image Caption Matching Loss: 0.1837 (0.1860) Masked Language Modeling Loss: 0.0000 (0.0000) Masked Visual Modeling Loss: 0.0000 (0.0000) Batch Accuracy (Align Regions, Choose Caption): 0.9531 (0.9427) Batch Accuracy (Align Regions, Choose Image): 0.9219 (0.9335) Batch Accuracy (Align Words, Choose Caption): 0.9375 (0.9354) Batch Accuracy (Align Words, Choose Image): 0.9375 (0.9490) Batch Accuracy (Choose Caption): 0.9688 (0.9637) Batch Accuracy (Choose Image): 0.9531 (0.9603) Masked Language Modeling Accuracy: 0.0000 (0.0000) lr: 0.000100 max mem: 12152
2020-08-04 19:20:09,537 maskrcnn_benchmark.utils.checkpoint INFO: Saving checkpoint to /home/alireza/runs/vltrain/121/model_final.pth
2020-08-04 19:20:13,599 maskrcnn_benchmark.trainer INFO: Total training time: 8:14:36.107446 (0.7419 s / it)
It is strange that you said only seen class performance cannot be reproduced. If the problem is in the pretraining stage, it should affect seen and unseen equally. Note that AP50_split_seen
and AP50_split_unseen
show the generalized performance when you evaluate on coco_generalized_zeroshot_val
. But they show base and target performance in non-generalized form when you evaluate on coco_not_zeroshot_val
and coco_zeroshot_val
respectively. Here is the final evaluation output I get after finetuning the pretrained model on object detection:
2020-08-06 17:06:18,368 maskrcnn_benchmark.inference INFO: Start evaluation on coco_not_zeroshot_val dataset(4533 images).
2020-08-06 17:07:17,418 maskrcnn_benchmark.inference INFO: Total run time: 0:00:59.049673 (0.10421296815432488 s / img per device, on 8 devices)
2020-08-06 17:07:17,418 maskrcnn_benchmark.inference INFO: Model inference time: 0:00:53.669309 (0.09471750964188508 s / img per device, on 8 devices)
2020-08-06 17:07:19,192 maskrcnn_benchmark.inference INFO: Evaluating bbox proposals
2020-08-06 17:07:56,323 maskrcnn_benchmark.inference INFO: Preparing results for COCO format
2020-08-06 17:07:56,325 maskrcnn_benchmark.inference INFO: Preparing bbox results
2020-08-06 17:07:57,865 maskrcnn_benchmark.inference INFO: Evaluating predictions
2020-08-06 17:08:47,831 maskrcnn_benchmark.inference INFO:
Task: box_proposal
AR@100, ARs@100, ARm@100, ARl@100, AR@1000, ARs@1000, ARm@1000, ARl@1000
0.4791, 0.2997, 0.5641, 0.6889, 0.4791, 0.2997, 0.5641, 0.6889
Task: bbox
AP, AP50, AP75, APs, APm, APl, AP50_class_person, AP50_class_bicycle, AP50_class_car, AP50_class_motorcycle, AP50_class_train, AP50_class_truck, AP50_class_boat, AP50_class_bench, AP50_class_bird, AP50_class_horse, AP50_class_sheep, AP50_class_bear, AP50_class_zebra, AP50_class_giraffe, AP50_class_backpack, AP50_class_handbag, AP50_class_suitcase, AP50_class_frisbee, AP50_class_skis, AP50_class_kite, AP50_class_surfboard, AP50_class_bottle, AP50_class_fork, AP50_class_spoon, AP50_class_bowl, AP50_class_banana, AP50_class_apple, AP50_class_sandwich, AP50_class_orange, AP50_class_broccoli, AP50_class_carrot, AP50_class_pizza, AP50_class_donut, AP50_class_chair, AP50_class_bed, AP50_class_toilet, AP50_class_tv, AP50_class_laptop, AP50_class_mouse, AP50_class_remote, AP50_class_microwave, AP50_class_oven, AP50_class_toaster, AP50_class_refrigerator, AP50_class_book, AP50_class_clock, AP50_class_vase, AP50_class_toothbrush, AP50_split_seen
0.2688, 0.4676, 0.2711, 0.1165, 0.2998, 0.3715, 0.7589, 0.4438, 0.5255, 0.6232, 0.7177, 0.3771, 0.3753, 0.2853, 0.4248, 0.6627, 0.5901, 0.7871, 0.8643, 0.8554, 0.1508, 0.1205, 0.3555, 0.6725, 0.2594, 0.4994, 0.4462, 0.4545, 0.2894, 0.1086, 0.4908, 0.3398, 0.2314, 0.3959, 0.3206, 0.3828, 0.2756, 0.6398, 0.4292, 0.3236, 0.5397, 0.6920, 0.6807, 0.6620, 0.6553, 0.2390, 0.6345, 0.4350, 0.3859, 0.6281, 0.2104, 0.6410, 0.4021, 0.1616, 0.4676
2020-08-06 17:08:48,499 maskrcnn_benchmark.inference INFO: Start evaluation on coco_zeroshot_val dataset(2064 images).
2020-08-06 17:09:11,053 maskrcnn_benchmark.inference INFO: Total run time: 0:00:22.554012 (0.08741865139598995 s / img per device, on 8 devices)
2020-08-06 17:09:11,054 maskrcnn_benchmark.inference INFO: Model inference time: 0:00:20.010486 (0.07756002466807994 s / img per device, on 8 devices)
2020-08-06 17:09:11,911 maskrcnn_benchmark.inference INFO: Evaluating bbox proposals
2020-08-06 17:09:22,189 maskrcnn_benchmark.inference INFO: Preparing results for COCO format
2020-08-06 17:09:22,189 maskrcnn_benchmark.inference INFO: Preparing bbox results
2020-08-06 17:09:22,505 maskrcnn_benchmark.inference INFO: Evaluating predictions
2020-08-06 17:09:27,167 maskrcnn_benchmark.inference INFO:
Task: box_proposal
AR@100, ARs@100, ARm@100, ARl@100, AR@1000, ARs@1000, ARm@1000, ARl@1000
0.3567, 0.1648, 0.3916, 0.5186, 0.3567, 0.1648, 0.3916, 0.5186
Task: bbox
AP, AP50, AP75, APs, APm, APl, AP50_class_airplane, AP50_class_bus, AP50_class_cat, AP50_class_dog, AP50_class_cow, AP50_class_elephant, AP50_class_umbrella, AP50_class_tie, AP50_class_snowboard, AP50_class_skateboard, AP50_class_cup, AP50_class_knife, AP50_class_cake, AP50_class_couch, AP50_class_keyboard, AP50_class_sink, AP50_class_scissors, AP50_split_unseen
0.1466, 0.2746, 0.1383, 0.0828, 0.1659, 0.2026, 0.3133, 0.6267, 0.3941, 0.5513, 0.5875, 0.5904, 0.1021, 0.0014, 0.3099, 0.0903, 0.2240, 0.0546, 0.1816, 0.3637, 0.0503, 0.1190, 0.1079, 0.2746
2020-08-06 17:09:27,281 maskrcnn_benchmark.inference INFO: Start evaluation on coco_generalized_zeroshot_val dataset(4836 images).
2020-08-06 17:10:32,876 maskrcnn_benchmark.inference INFO: Total run time: 0:01:05.595098 (0.10851132800699365 s / img per device, on 8 devices)
2020-08-06 17:10:32,879 maskrcnn_benchmark.inference INFO: Model inference time: 0:00:57.917472 (0.09581054154262748 s / img per device, on 8 devices)
2020-08-06 17:10:34,701 maskrcnn_benchmark.inference INFO: Evaluating bbox proposals
2020-08-06 17:11:12,835 maskrcnn_benchmark.inference INFO: Preparing results for COCO format
2020-08-06 17:11:12,837 maskrcnn_benchmark.inference INFO: Preparing bbox results
2020-08-06 17:11:14,481 maskrcnn_benchmark.inference INFO: Evaluating predictions
2020-08-06 17:12:09,313 maskrcnn_benchmark.inference INFO:
Task: box_proposal
AR@100, ARs@100, ARm@100, ARl@100, AR@1000, ARs@1000, ARm@1000, ARl@1000
0.4700, 0.2891, 0.5485, 0.6707, 0.4700, 0.2891, 0.5485, 0.6707
Task: bbox
AP, AP50, AP75, APs, APm, APl, AP50_class_person, AP50_class_bicycle, AP50_class_car, AP50_class_motorcycle, AP50_class_airplane, AP50_class_bus, AP50_class_train, AP50_class_truck, AP50_class_boat, AP50_class_bench, AP50_class_bird, AP50_class_cat, AP50_class_dog, AP50_class_horse, AP50_class_sheep, AP50_class_cow, AP50_class_elephant, AP50_class_bear, AP50_class_zebra, AP50_class_giraffe, AP50_class_backpack, AP50_class_umbrella, AP50_class_handbag, AP50_class_tie, AP50_class_suitcase, AP50_class_frisbee, AP50_class_skis, AP50_class_snowboard, AP50_class_kite, AP50_class_skateboard, AP50_class_surfboard, AP50_class_bottle, AP50_class_cup, AP50_class_fork, AP50_class_knife, AP50_class_spoon, AP50_class_bowl, AP50_class_banana, AP50_class_apple, AP50_class_sandwich, AP50_class_orange, AP50_class_broccoli, AP50_class_carrot, AP50_class_pizza, AP50_class_donut, AP50_class_cake, AP50_class_chair, AP50_class_couch, AP50_class_bed, AP50_class_toilet, AP50_class_tv, AP50_class_laptop, AP50_class_mouse, AP50_class_remote, AP50_class_keyboard, AP50_class_microwave, AP50_class_oven, AP50_class_toaster, AP50_class_sink, AP50_class_refrigerator, AP50_class_book, AP50_class_clock, AP50_class_vase, AP50_class_scissors, AP50_class_toothbrush, AP50_split_seen, AP50_split_unseen
0.2258, 0.3993, 0.2263, 0.0955, 0.2511, 0.3154, 0.7582, 0.4409, 0.5255, 0.6233, 0.2989, 0.4933, 0.7110, 0.3726, 0.3738, 0.2838, 0.4177, 0.3214, 0.5163, 0.6437, 0.5649, 0.4667, 0.5544, 0.7802, 0.8638, 0.8565, 0.1503, 0.0799, 0.1194, 0.0024, 0.3527, 0.6709, 0.2279, 0.1133, 0.4983, 0.1333, 0.4358, 0.4520, 0.2054, 0.2715, 0.0302, 0.0969, 0.4729, 0.3383, 0.2318, 0.3924, 0.3188, 0.3817, 0.2764, 0.6373, 0.4250, 0.1394, 0.3194, 0.2536, 0.5354, 0.6923, 0.6776, 0.6532, 0.6484, 0.2376, 0.0646, 0.6274, 0.4387, 0.2619, 0.1083, 0.6259, 0.2125, 0.6381, 0.3886, 0.0946, 0.1533, 0.4599, 0.2280
@alirezazareian Thanks for your reply!! I found the reason. I use detectron2 ImageNet pretrained model rather than maskrcnn_benchmark ImageNet pretrained model. Now the results are normal.
Here is the log file: https://github.com/lxtGH/ovr-cnn/blob/master/log.txt The Masked Language Modeling Accuracy is nearly zero.
How you resolve the problem about this? Could you provide me the details?
Hi! @alirezazareian Could you help me on how to reproduce the pretrained ckpt in caption pretraining stage.