fundamentalvision / Deformable-DETR

Deformable DETR: Deformable Transformers for End-to-End Object Detection.
Apache License 2.0
3.15k stars 513 forks source link

Fine tune on custom dataset - parts to remove #126

Open aviadlazar opened 2 years ago

aviadlazar commented 2 years ago

I'm interested in fine tuning a model trained on COCO for custom data, but I'm unsure about what layers/parts of the model should I remove to make the detection head compatible with the new dataset. Here is a list of components of r50_deformable_detr_plus_iterative_bbox_refinement_plus_plus_two_stage-checkpoint.pth:

transformer.level_embed
transformer.encoder.layers.0.self_attn.sampling_offsets.weight
transformer.encoder.layers.0.self_attn.sampling_offsets.bias
transformer.encoder.layers.0.self_attn.attention_weights.weight
transformer.encoder.layers.0.self_attn.attention_weights.bias
transformer.encoder.layers.0.self_attn.value_proj.weight
transformer.encoder.layers.0.self_attn.value_proj.bias
transformer.encoder.layers.0.self_attn.output_proj.weight
transformer.encoder.layers.0.self_attn.output_proj.bias
transformer.encoder.layers.0.norm1.weight
transformer.encoder.layers.0.norm1.bias
transformer.encoder.layers.0.linear1.weight
transformer.encoder.layers.0.linear1.bias
transformer.encoder.layers.0.linear2.weight
transformer.encoder.layers.0.linear2.bias
transformer.encoder.layers.0.norm2.weight
transformer.encoder.layers.0.norm2.bias
transformer.encoder.layers.1.self_attn.sampling_offsets.weight
transformer.encoder.layers.1.self_attn.sampling_offsets.bias
transformer.encoder.layers.1.self_attn.attention_weights.weight
transformer.encoder.layers.1.self_attn.attention_weights.bias
transformer.encoder.layers.1.self_attn.value_proj.weight
transformer.encoder.layers.1.self_attn.value_proj.bias
transformer.encoder.layers.1.self_attn.output_proj.weight
transformer.encoder.layers.1.self_attn.output_proj.bias
transformer.encoder.layers.1.norm1.weight
transformer.encoder.layers.1.norm1.bias
transformer.encoder.layers.1.linear1.weight
transformer.encoder.layers.1.linear1.bias
transformer.encoder.layers.1.linear2.weight
transformer.encoder.layers.1.linear2.bias
transformer.encoder.layers.1.norm2.weight
transformer.encoder.layers.1.norm2.bias
transformer.encoder.layers.2.self_attn.sampling_offsets.weight
transformer.encoder.layers.2.self_attn.sampling_offsets.bias
transformer.encoder.layers.2.self_attn.attention_weights.weight
transformer.encoder.layers.2.self_attn.attention_weights.bias
transformer.encoder.layers.2.self_attn.value_proj.weight
transformer.encoder.layers.2.self_attn.value_proj.bias
transformer.encoder.layers.2.self_attn.output_proj.weight
transformer.encoder.layers.2.self_attn.output_proj.bias
transformer.encoder.layers.2.norm1.weight
transformer.encoder.layers.2.norm1.bias
transformer.encoder.layers.2.linear1.weight
transformer.encoder.layers.2.linear1.bias
transformer.encoder.layers.2.linear2.weight
transformer.encoder.layers.2.linear2.bias
transformer.encoder.layers.2.norm2.weight
transformer.encoder.layers.2.norm2.bias
transformer.encoder.layers.3.self_attn.sampling_offsets.weight
transformer.encoder.layers.3.self_attn.sampling_offsets.bias
transformer.encoder.layers.3.self_attn.attention_weights.weight
transformer.encoder.layers.3.self_attn.attention_weights.bias
transformer.encoder.layers.3.self_attn.value_proj.weight
transformer.encoder.layers.3.self_attn.value_proj.bias
transformer.encoder.layers.3.self_attn.output_proj.weight
transformer.encoder.layers.3.self_attn.output_proj.bias
transformer.encoder.layers.3.norm1.weight
transformer.encoder.layers.3.norm1.bias
transformer.encoder.layers.3.linear1.weight
transformer.encoder.layers.3.linear1.bias
transformer.encoder.layers.3.linear2.weight
transformer.encoder.layers.3.linear2.bias
transformer.encoder.layers.3.norm2.weight
transformer.encoder.layers.3.norm2.bias
transformer.encoder.layers.4.self_attn.sampling_offsets.weight
transformer.encoder.layers.4.self_attn.sampling_offsets.bias
transformer.encoder.layers.4.self_attn.attention_weights.weight
transformer.encoder.layers.4.self_attn.attention_weights.bias
transformer.encoder.layers.4.self_attn.value_proj.weight
transformer.encoder.layers.4.self_attn.value_proj.bias
transformer.encoder.layers.4.self_attn.output_proj.weight
transformer.encoder.layers.4.self_attn.output_proj.bias
transformer.encoder.layers.4.norm1.weight
transformer.encoder.layers.4.norm1.bias
transformer.encoder.layers.4.linear1.weight
transformer.encoder.layers.4.linear1.bias
transformer.encoder.layers.4.linear2.weight
transformer.encoder.layers.4.linear2.bias
transformer.encoder.layers.4.norm2.weight
transformer.encoder.layers.4.norm2.bias
transformer.encoder.layers.5.self_attn.sampling_offsets.weight
transformer.encoder.layers.5.self_attn.sampling_offsets.bias
transformer.encoder.layers.5.self_attn.attention_weights.weight
transformer.encoder.layers.5.self_attn.attention_weights.bias
transformer.encoder.layers.5.self_attn.value_proj.weight
transformer.encoder.layers.5.self_attn.value_proj.bias
transformer.encoder.layers.5.self_attn.output_proj.weight
transformer.encoder.layers.5.self_attn.output_proj.bias
transformer.encoder.layers.5.norm1.weight
transformer.encoder.layers.5.norm1.bias
transformer.encoder.layers.5.linear1.weight
transformer.encoder.layers.5.linear1.bias
transformer.encoder.layers.5.linear2.weight
transformer.encoder.layers.5.linear2.bias
transformer.encoder.layers.5.norm2.weight
transformer.encoder.layers.5.norm2.bias
transformer.decoder.layers.0.cross_attn.sampling_offsets.weight
transformer.decoder.layers.0.cross_attn.sampling_offsets.bias
transformer.decoder.layers.0.cross_attn.attention_weights.weight
transformer.decoder.layers.0.cross_attn.attention_weights.bias
transformer.decoder.layers.0.cross_attn.value_proj.weight
transformer.decoder.layers.0.cross_attn.value_proj.bias
transformer.decoder.layers.0.cross_attn.output_proj.weight
transformer.decoder.layers.0.cross_attn.output_proj.bias
transformer.decoder.layers.0.norm1.weight
transformer.decoder.layers.0.norm1.bias
transformer.decoder.layers.0.self_attn.in_proj_weight
transformer.decoder.layers.0.self_attn.in_proj_bias
transformer.decoder.layers.0.self_attn.out_proj.weight
transformer.decoder.layers.0.self_attn.out_proj.bias
transformer.decoder.layers.0.norm2.weight
transformer.decoder.layers.0.norm2.bias
transformer.decoder.layers.0.linear1.weight
transformer.decoder.layers.0.linear1.bias
transformer.decoder.layers.0.linear2.weight
transformer.decoder.layers.0.linear2.bias
transformer.decoder.layers.0.norm3.weight
transformer.decoder.layers.0.norm3.bias
transformer.decoder.layers.1.cross_attn.sampling_offsets.weight
transformer.decoder.layers.1.cross_attn.sampling_offsets.bias
transformer.decoder.layers.1.cross_attn.attention_weights.weight
transformer.decoder.layers.1.cross_attn.attention_weights.bias
transformer.decoder.layers.1.cross_attn.value_proj.weight
transformer.decoder.layers.1.cross_attn.value_proj.bias
transformer.decoder.layers.1.cross_attn.output_proj.weight
transformer.decoder.layers.1.cross_attn.output_proj.bias
transformer.decoder.layers.1.norm1.weight
transformer.decoder.layers.1.norm1.bias
transformer.decoder.layers.1.self_attn.in_proj_weight
transformer.decoder.layers.1.self_attn.in_proj_bias
transformer.decoder.layers.1.self_attn.out_proj.weight
transformer.decoder.layers.1.self_attn.out_proj.bias
transformer.decoder.layers.1.norm2.weight
transformer.decoder.layers.1.norm2.bias
transformer.decoder.layers.1.linear1.weight
transformer.decoder.layers.1.linear1.bias
transformer.decoder.layers.1.linear2.weight
transformer.decoder.layers.1.linear2.bias
transformer.decoder.layers.1.norm3.weight
transformer.decoder.layers.1.norm3.bias
transformer.decoder.layers.2.cross_attn.sampling_offsets.weight
transformer.decoder.layers.2.cross_attn.sampling_offsets.bias
transformer.decoder.layers.2.cross_attn.attention_weights.weight
transformer.decoder.layers.2.cross_attn.attention_weights.bias
transformer.decoder.layers.2.cross_attn.value_proj.weight
transformer.decoder.layers.2.cross_attn.value_proj.bias
transformer.decoder.layers.2.cross_attn.output_proj.weight
transformer.decoder.layers.2.cross_attn.output_proj.bias
transformer.decoder.layers.2.norm1.weight
transformer.decoder.layers.2.norm1.bias
transformer.decoder.layers.2.self_attn.in_proj_weight
transformer.decoder.layers.2.self_attn.in_proj_bias
transformer.decoder.layers.2.self_attn.out_proj.weight
transformer.decoder.layers.2.self_attn.out_proj.bias
transformer.decoder.layers.2.norm2.weight
transformer.decoder.layers.2.norm2.bias
transformer.decoder.layers.2.linear1.weight
transformer.decoder.layers.2.linear1.bias
transformer.decoder.layers.2.linear2.weight
transformer.decoder.layers.2.linear2.bias
transformer.decoder.layers.2.norm3.weight
transformer.decoder.layers.2.norm3.bias
transformer.decoder.layers.3.cross_attn.sampling_offsets.weight
transformer.decoder.layers.3.cross_attn.sampling_offsets.bias
transformer.decoder.layers.3.cross_attn.attention_weights.weight
transformer.decoder.layers.3.cross_attn.attention_weights.bias
transformer.decoder.layers.3.cross_attn.value_proj.weight
transformer.decoder.layers.3.cross_attn.value_proj.bias
transformer.decoder.layers.3.cross_attn.output_proj.weight
transformer.decoder.layers.3.cross_attn.output_proj.bias
transformer.decoder.layers.3.norm1.weight
transformer.decoder.layers.3.norm1.bias
transformer.decoder.layers.3.self_attn.in_proj_weight
transformer.decoder.layers.3.self_attn.in_proj_bias
transformer.decoder.layers.3.self_attn.out_proj.weight
transformer.decoder.layers.3.self_attn.out_proj.bias
transformer.decoder.layers.3.norm2.weight
transformer.decoder.layers.3.norm2.bias
transformer.decoder.layers.3.linear1.weight
transformer.decoder.layers.3.linear1.bias
transformer.decoder.layers.3.linear2.weight
transformer.decoder.layers.3.linear2.bias
transformer.decoder.layers.3.norm3.weight
transformer.decoder.layers.3.norm3.bias
transformer.decoder.layers.4.cross_attn.sampling_offsets.weight
transformer.decoder.layers.4.cross_attn.sampling_offsets.bias
transformer.decoder.layers.4.cross_attn.attention_weights.weight
transformer.decoder.layers.4.cross_attn.attention_weights.bias
transformer.decoder.layers.4.cross_attn.value_proj.weight
transformer.decoder.layers.4.cross_attn.value_proj.bias
transformer.decoder.layers.4.cross_attn.output_proj.weight
transformer.decoder.layers.4.cross_attn.output_proj.bias
transformer.decoder.layers.4.norm1.weight
transformer.decoder.layers.4.norm1.bias
transformer.decoder.layers.4.self_attn.in_proj_weight
transformer.decoder.layers.4.self_attn.in_proj_bias
transformer.decoder.layers.4.self_attn.out_proj.weight
transformer.decoder.layers.4.self_attn.out_proj.bias
transformer.decoder.layers.4.norm2.weight
transformer.decoder.layers.4.norm2.bias
transformer.decoder.layers.4.linear1.weight
transformer.decoder.layers.4.linear1.bias
transformer.decoder.layers.4.linear2.weight
transformer.decoder.layers.4.linear2.bias
transformer.decoder.layers.4.norm3.weight
transformer.decoder.layers.4.norm3.bias
transformer.decoder.layers.5.cross_attn.sampling_offsets.weight
transformer.decoder.layers.5.cross_attn.sampling_offsets.bias
transformer.decoder.layers.5.cross_attn.attention_weights.weight
transformer.decoder.layers.5.cross_attn.attention_weights.bias
transformer.decoder.layers.5.cross_attn.value_proj.weight
transformer.decoder.layers.5.cross_attn.value_proj.bias
transformer.decoder.layers.5.cross_attn.output_proj.weight
transformer.decoder.layers.5.cross_attn.output_proj.bias
transformer.decoder.layers.5.norm1.weight
transformer.decoder.layers.5.norm1.bias
transformer.decoder.layers.5.self_attn.in_proj_weight
transformer.decoder.layers.5.self_attn.in_proj_bias
transformer.decoder.layers.5.self_attn.out_proj.weight
transformer.decoder.layers.5.self_attn.out_proj.bias
transformer.decoder.layers.5.norm2.weight
transformer.decoder.layers.5.norm2.bias
transformer.decoder.layers.5.linear1.weight
transformer.decoder.layers.5.linear1.bias
transformer.decoder.layers.5.linear2.weight
transformer.decoder.layers.5.linear2.bias
transformer.decoder.layers.5.norm3.weight
transformer.decoder.layers.5.norm3.bias
transformer.decoder.bbox_embed.0.layers.0.weight
transformer.decoder.bbox_embed.0.layers.0.bias
transformer.decoder.bbox_embed.0.layers.1.weight
transformer.decoder.bbox_embed.0.layers.1.bias
transformer.decoder.bbox_embed.0.layers.2.weight
transformer.decoder.bbox_embed.0.layers.2.bias
transformer.decoder.bbox_embed.1.layers.0.weight
transformer.decoder.bbox_embed.1.layers.0.bias
transformer.decoder.bbox_embed.1.layers.1.weight
transformer.decoder.bbox_embed.1.layers.1.bias
transformer.decoder.bbox_embed.1.layers.2.weight
transformer.decoder.bbox_embed.1.layers.2.bias
transformer.decoder.bbox_embed.2.layers.0.weight
transformer.decoder.bbox_embed.2.layers.0.bias
transformer.decoder.bbox_embed.2.layers.1.weight
transformer.decoder.bbox_embed.2.layers.1.bias
transformer.decoder.bbox_embed.2.layers.2.weight
transformer.decoder.bbox_embed.2.layers.2.bias
transformer.decoder.bbox_embed.3.layers.0.weight
transformer.decoder.bbox_embed.3.layers.0.bias
transformer.decoder.bbox_embed.3.layers.1.weight
transformer.decoder.bbox_embed.3.layers.1.bias
transformer.decoder.bbox_embed.3.layers.2.weight
transformer.decoder.bbox_embed.3.layers.2.bias
transformer.decoder.bbox_embed.4.layers.0.weight
transformer.decoder.bbox_embed.4.layers.0.bias
transformer.decoder.bbox_embed.4.layers.1.weight
transformer.decoder.bbox_embed.4.layers.1.bias
transformer.decoder.bbox_embed.4.layers.2.weight
transformer.decoder.bbox_embed.4.layers.2.bias
transformer.decoder.bbox_embed.5.layers.0.weight
transformer.decoder.bbox_embed.5.layers.0.bias
transformer.decoder.bbox_embed.5.layers.1.weight
transformer.decoder.bbox_embed.5.layers.1.bias
transformer.decoder.bbox_embed.5.layers.2.weight
transformer.decoder.bbox_embed.5.layers.2.bias
transformer.decoder.bbox_embed.6.layers.0.weight
transformer.decoder.bbox_embed.6.layers.0.bias
transformer.decoder.bbox_embed.6.layers.1.weight
transformer.decoder.bbox_embed.6.layers.1.bias
transformer.decoder.bbox_embed.6.layers.2.weight
transformer.decoder.bbox_embed.6.layers.2.bias
transformer.decoder.class_embed.0.weight
transformer.decoder.class_embed.0.bias
transformer.decoder.class_embed.1.weight
transformer.decoder.class_embed.1.bias
transformer.decoder.class_embed.2.weight
transformer.decoder.class_embed.2.bias
transformer.decoder.class_embed.3.weight
transformer.decoder.class_embed.3.bias
transformer.decoder.class_embed.4.weight
transformer.decoder.class_embed.4.bias
transformer.decoder.class_embed.5.weight
transformer.decoder.class_embed.5.bias
transformer.decoder.class_embed.6.weight
transformer.decoder.class_embed.6.bias
transformer.enc_output.weight
transformer.enc_output.bias
transformer.enc_output_norm.weight
transformer.enc_output_norm.bias
transformer.pos_trans.weight
transformer.pos_trans.bias
transformer.pos_trans_norm.weight
transformer.pos_trans_norm.bias
class_embed.0.weight
class_embed.0.bias
class_embed.1.weight
class_embed.1.bias
class_embed.2.weight
class_embed.2.bias
class_embed.3.weight
class_embed.3.bias
class_embed.4.weight
class_embed.4.bias
class_embed.5.weight
class_embed.5.bias
class_embed.6.weight
class_embed.6.bias
bbox_embed.0.layers.0.weight
bbox_embed.0.layers.0.bias
bbox_embed.0.layers.1.weight
bbox_embed.0.layers.1.bias
bbox_embed.0.layers.2.weight
bbox_embed.0.layers.2.bias
bbox_embed.1.layers.0.weight
bbox_embed.1.layers.0.bias
bbox_embed.1.layers.1.weight
bbox_embed.1.layers.1.bias
bbox_embed.1.layers.2.weight
bbox_embed.1.layers.2.bias
bbox_embed.2.layers.0.weight
bbox_embed.2.layers.0.bias
bbox_embed.2.layers.1.weight
bbox_embed.2.layers.1.bias
bbox_embed.2.layers.2.weight
bbox_embed.2.layers.2.bias
bbox_embed.3.layers.0.weight
bbox_embed.3.layers.0.bias
bbox_embed.3.layers.1.weight
bbox_embed.3.layers.1.bias
bbox_embed.3.layers.2.weight
bbox_embed.3.layers.2.bias
bbox_embed.4.layers.0.weight
bbox_embed.4.layers.0.bias
bbox_embed.4.layers.1.weight
bbox_embed.4.layers.1.bias
bbox_embed.4.layers.2.weight
bbox_embed.4.layers.2.bias
bbox_embed.5.layers.0.weight
bbox_embed.5.layers.0.bias
bbox_embed.5.layers.1.weight
bbox_embed.5.layers.1.bias
bbox_embed.5.layers.2.weight
bbox_embed.5.layers.2.bias
bbox_embed.6.layers.0.weight
bbox_embed.6.layers.0.bias
bbox_embed.6.layers.1.weight
bbox_embed.6.layers.1.bias
bbox_embed.6.layers.2.weight
bbox_embed.6.layers.2.bias
input_proj.0.0.weight
input_proj.0.0.bias
input_proj.0.1.weight
input_proj.0.1.bias
input_proj.1.0.weight
input_proj.1.0.bias
input_proj.1.1.weight
input_proj.1.1.bias
input_proj.2.0.weight
input_proj.2.0.bias
input_proj.2.1.weight
input_proj.2.1.bias
input_proj.3.0.weight
input_proj.3.0.bias
input_proj.3.1.weight
input_proj.3.1.bias
backbone.0.body.conv1.weight
backbone.0.body.bn1.weight
backbone.0.body.bn1.bias
backbone.0.body.bn1.running_mean
backbone.0.body.bn1.running_var
backbone.0.body.layer1.0.conv1.weight
backbone.0.body.layer1.0.bn1.weight
backbone.0.body.layer1.0.bn1.bias
backbone.0.body.layer1.0.bn1.running_mean
backbone.0.body.layer1.0.bn1.running_var
backbone.0.body.layer1.0.conv2.weight
backbone.0.body.layer1.0.bn2.weight
backbone.0.body.layer1.0.bn2.bias
backbone.0.body.layer1.0.bn2.running_mean
backbone.0.body.layer1.0.bn2.running_var
backbone.0.body.layer1.0.conv3.weight
backbone.0.body.layer1.0.bn3.weight
backbone.0.body.layer1.0.bn3.bias
backbone.0.body.layer1.0.bn3.running_mean
backbone.0.body.layer1.0.bn3.running_var
backbone.0.body.layer1.0.downsample.0.weight
backbone.0.body.layer1.0.downsample.1.weight
backbone.0.body.layer1.0.downsample.1.bias
backbone.0.body.layer1.0.downsample.1.running_mean
backbone.0.body.layer1.0.downsample.1.running_var
backbone.0.body.layer1.1.conv1.weight
backbone.0.body.layer1.1.bn1.weight
backbone.0.body.layer1.1.bn1.bias
backbone.0.body.layer1.1.bn1.running_mean
backbone.0.body.layer1.1.bn1.running_var
backbone.0.body.layer1.1.conv2.weight
backbone.0.body.layer1.1.bn2.weight
backbone.0.body.layer1.1.bn2.bias
backbone.0.body.layer1.1.bn2.running_mean
backbone.0.body.layer1.1.bn2.running_var
backbone.0.body.layer1.1.conv3.weight
backbone.0.body.layer1.1.bn3.weight
backbone.0.body.layer1.1.bn3.bias
backbone.0.body.layer1.1.bn3.running_mean
backbone.0.body.layer1.1.bn3.running_var
backbone.0.body.layer1.2.conv1.weight
backbone.0.body.layer1.2.bn1.weight
backbone.0.body.layer1.2.bn1.bias
backbone.0.body.layer1.2.bn1.running_mean
backbone.0.body.layer1.2.bn1.running_var
backbone.0.body.layer1.2.conv2.weight
backbone.0.body.layer1.2.bn2.weight
backbone.0.body.layer1.2.bn2.bias
backbone.0.body.layer1.2.bn2.running_mean
backbone.0.body.layer1.2.bn2.running_var
backbone.0.body.layer1.2.conv3.weight
backbone.0.body.layer1.2.bn3.weight
backbone.0.body.layer1.2.bn3.bias
backbone.0.body.layer1.2.bn3.running_mean
backbone.0.body.layer1.2.bn3.running_var
backbone.0.body.layer2.0.conv1.weight
backbone.0.body.layer2.0.bn1.weight
backbone.0.body.layer2.0.bn1.bias
backbone.0.body.layer2.0.bn1.running_mean
backbone.0.body.layer2.0.bn1.running_var
backbone.0.body.layer2.0.conv2.weight
backbone.0.body.layer2.0.bn2.weight
backbone.0.body.layer2.0.bn2.bias
backbone.0.body.layer2.0.bn2.running_mean
backbone.0.body.layer2.0.bn2.running_var
backbone.0.body.layer2.0.conv3.weight
backbone.0.body.layer2.0.bn3.weight
backbone.0.body.layer2.0.bn3.bias
backbone.0.body.layer2.0.bn3.running_mean
backbone.0.body.layer2.0.bn3.running_var
backbone.0.body.layer2.0.downsample.0.weight
backbone.0.body.layer2.0.downsample.1.weight
backbone.0.body.layer2.0.downsample.1.bias
backbone.0.body.layer2.0.downsample.1.running_mean
backbone.0.body.layer2.0.downsample.1.running_var
backbone.0.body.layer2.1.conv1.weight
backbone.0.body.layer2.1.bn1.weight
backbone.0.body.layer2.1.bn1.bias
backbone.0.body.layer2.1.bn1.running_mean
backbone.0.body.layer2.1.bn1.running_var
backbone.0.body.layer2.1.conv2.weight
backbone.0.body.layer2.1.bn2.weight
backbone.0.body.layer2.1.bn2.bias
backbone.0.body.layer2.1.bn2.running_mean
backbone.0.body.layer2.1.bn2.running_var
backbone.0.body.layer2.1.conv3.weight
backbone.0.body.layer2.1.bn3.weight
backbone.0.body.layer2.1.bn3.bias
backbone.0.body.layer2.1.bn3.running_mean
backbone.0.body.layer2.1.bn3.running_var
backbone.0.body.layer2.2.conv1.weight
backbone.0.body.layer2.2.bn1.weight
backbone.0.body.layer2.2.bn1.bias
backbone.0.body.layer2.2.bn1.running_mean
backbone.0.body.layer2.2.bn1.running_var
backbone.0.body.layer2.2.conv2.weight
backbone.0.body.layer2.2.bn2.weight
backbone.0.body.layer2.2.bn2.bias
backbone.0.body.layer2.2.bn2.running_mean
backbone.0.body.layer2.2.bn2.running_var
backbone.0.body.layer2.2.conv3.weight
backbone.0.body.layer2.2.bn3.weight
backbone.0.body.layer2.2.bn3.bias
backbone.0.body.layer2.2.bn3.running_mean
backbone.0.body.layer2.2.bn3.running_var
backbone.0.body.layer2.3.conv1.weight
backbone.0.body.layer2.3.bn1.weight
backbone.0.body.layer2.3.bn1.bias
backbone.0.body.layer2.3.bn1.running_mean
backbone.0.body.layer2.3.bn1.running_var
backbone.0.body.layer2.3.conv2.weight
backbone.0.body.layer2.3.bn2.weight
backbone.0.body.layer2.3.bn2.bias
backbone.0.body.layer2.3.bn2.running_mean
backbone.0.body.layer2.3.bn2.running_var
backbone.0.body.layer2.3.conv3.weight
backbone.0.body.layer2.3.bn3.weight
backbone.0.body.layer2.3.bn3.bias
backbone.0.body.layer2.3.bn3.running_mean
backbone.0.body.layer2.3.bn3.running_var
backbone.0.body.layer3.0.conv1.weight
backbone.0.body.layer3.0.bn1.weight
backbone.0.body.layer3.0.bn1.bias
backbone.0.body.layer3.0.bn1.running_mean
backbone.0.body.layer3.0.bn1.running_var
backbone.0.body.layer3.0.conv2.weight
backbone.0.body.layer3.0.bn2.weight
backbone.0.body.layer3.0.bn2.bias
backbone.0.body.layer3.0.bn2.running_mean
backbone.0.body.layer3.0.bn2.running_var
backbone.0.body.layer3.0.conv3.weight
backbone.0.body.layer3.0.bn3.weight
backbone.0.body.layer3.0.bn3.bias
backbone.0.body.layer3.0.bn3.running_mean
backbone.0.body.layer3.0.bn3.running_var
backbone.0.body.layer3.0.downsample.0.weight
backbone.0.body.layer3.0.downsample.1.weight
backbone.0.body.layer3.0.downsample.1.bias
backbone.0.body.layer3.0.downsample.1.running_mean
backbone.0.body.layer3.0.downsample.1.running_var
backbone.0.body.layer3.1.conv1.weight
backbone.0.body.layer3.1.bn1.weight
backbone.0.body.layer3.1.bn1.bias
backbone.0.body.layer3.1.bn1.running_mean
backbone.0.body.layer3.1.bn1.running_var
backbone.0.body.layer3.1.conv2.weight
backbone.0.body.layer3.1.bn2.weight
backbone.0.body.layer3.1.bn2.bias
backbone.0.body.layer3.1.bn2.running_mean
backbone.0.body.layer3.1.bn2.running_var
backbone.0.body.layer3.1.conv3.weight
backbone.0.body.layer3.1.bn3.weight
backbone.0.body.layer3.1.bn3.bias
backbone.0.body.layer3.1.bn3.running_mean
backbone.0.body.layer3.1.bn3.running_var
backbone.0.body.layer3.2.conv1.weight
backbone.0.body.layer3.2.bn1.weight
backbone.0.body.layer3.2.bn1.bias
backbone.0.body.layer3.2.bn1.running_mean
backbone.0.body.layer3.2.bn1.running_var
backbone.0.body.layer3.2.conv2.weight
backbone.0.body.layer3.2.bn2.weight
backbone.0.body.layer3.2.bn2.bias
backbone.0.body.layer3.2.bn2.running_mean
backbone.0.body.layer3.2.bn2.running_var
backbone.0.body.layer3.2.conv3.weight
backbone.0.body.layer3.2.bn3.weight
backbone.0.body.layer3.2.bn3.bias
backbone.0.body.layer3.2.bn3.running_mean
backbone.0.body.layer3.2.bn3.running_var
backbone.0.body.layer3.3.conv1.weight
backbone.0.body.layer3.3.bn1.weight
backbone.0.body.layer3.3.bn1.bias
backbone.0.body.layer3.3.bn1.running_mean
backbone.0.body.layer3.3.bn1.running_var
backbone.0.body.layer3.3.conv2.weight
backbone.0.body.layer3.3.bn2.weight
backbone.0.body.layer3.3.bn2.bias
backbone.0.body.layer3.3.bn2.running_mean
backbone.0.body.layer3.3.bn2.running_var
backbone.0.body.layer3.3.conv3.weight
backbone.0.body.layer3.3.bn3.weight
backbone.0.body.layer3.3.bn3.bias
backbone.0.body.layer3.3.bn3.running_mean
backbone.0.body.layer3.3.bn3.running_var
backbone.0.body.layer3.4.conv1.weight
backbone.0.body.layer3.4.bn1.weight
backbone.0.body.layer3.4.bn1.bias
backbone.0.body.layer3.4.bn1.running_mean
backbone.0.body.layer3.4.bn1.running_var
backbone.0.body.layer3.4.conv2.weight
backbone.0.body.layer3.4.bn2.weight
backbone.0.body.layer3.4.bn2.bias
backbone.0.body.layer3.4.bn2.running_mean
backbone.0.body.layer3.4.bn2.running_var
backbone.0.body.layer3.4.conv3.weight
backbone.0.body.layer3.4.bn3.weight
backbone.0.body.layer3.4.bn3.bias
backbone.0.body.layer3.4.bn3.running_mean
backbone.0.body.layer3.4.bn3.running_var
backbone.0.body.layer3.5.conv1.weight
backbone.0.body.layer3.5.bn1.weight
backbone.0.body.layer3.5.bn1.bias
backbone.0.body.layer3.5.bn1.running_mean
backbone.0.body.layer3.5.bn1.running_var
backbone.0.body.layer3.5.conv2.weight
backbone.0.body.layer3.5.bn2.weight
backbone.0.body.layer3.5.bn2.bias
backbone.0.body.layer3.5.bn2.running_mean
backbone.0.body.layer3.5.bn2.running_var
backbone.0.body.layer3.5.conv3.weight
backbone.0.body.layer3.5.bn3.weight
backbone.0.body.layer3.5.bn3.bias
backbone.0.body.layer3.5.bn3.running_mean
backbone.0.body.layer3.5.bn3.running_var
backbone.0.body.layer4.0.conv1.weight
backbone.0.body.layer4.0.bn1.weight
backbone.0.body.layer4.0.bn1.bias
backbone.0.body.layer4.0.bn1.running_mean
backbone.0.body.layer4.0.bn1.running_var
backbone.0.body.layer4.0.conv2.weight
backbone.0.body.layer4.0.bn2.weight
backbone.0.body.layer4.0.bn2.bias
backbone.0.body.layer4.0.bn2.running_mean
backbone.0.body.layer4.0.bn2.running_var
backbone.0.body.layer4.0.conv3.weight
backbone.0.body.layer4.0.bn3.weight
backbone.0.body.layer4.0.bn3.bias
backbone.0.body.layer4.0.bn3.running_mean
backbone.0.body.layer4.0.bn3.running_var
backbone.0.body.layer4.0.downsample.0.weight
backbone.0.body.layer4.0.downsample.1.weight
backbone.0.body.layer4.0.downsample.1.bias
backbone.0.body.layer4.0.downsample.1.running_mean
backbone.0.body.layer4.0.downsample.1.running_var
backbone.0.body.layer4.1.conv1.weight
backbone.0.body.layer4.1.bn1.weight
backbone.0.body.layer4.1.bn1.bias
backbone.0.body.layer4.1.bn1.running_mean
backbone.0.body.layer4.1.bn1.running_var
backbone.0.body.layer4.1.conv2.weight
backbone.0.body.layer4.1.bn2.weight
backbone.0.body.layer4.1.bn2.bias
backbone.0.body.layer4.1.bn2.running_mean
backbone.0.body.layer4.1.bn2.running_var
backbone.0.body.layer4.1.conv3.weight
backbone.0.body.layer4.1.bn3.weight
backbone.0.body.layer4.1.bn3.bias
backbone.0.body.layer4.1.bn3.running_mean
backbone.0.body.layer4.1.bn3.running_var
backbone.0.body.layer4.2.conv1.weight
backbone.0.body.layer4.2.bn1.weight
backbone.0.body.layer4.2.bn1.bias
backbone.0.body.layer4.2.bn1.running_mean
backbone.0.body.layer4.2.bn1.running_var
backbone.0.body.layer4.2.conv2.weight
backbone.0.body.layer4.2.bn2.weight
backbone.0.body.layer4.2.bn2.bias
backbone.0.body.layer4.2.bn2.running_mean
backbone.0.body.layer4.2.bn2.running_var
backbone.0.body.layer4.2.conv3.weight
backbone.0.body.layer4.2.bn3.weight
backbone.0.body.layer4.2.bn3.bias
backbone.0.body.layer4.2.bn3.running_mean
backbone.0.body.layer4.2.bn3.running_var

It seems that its easier to tell that some parts should be removed, like transformer.decoder.class_embed.0.weight (I could still be wrong here), but there are other parts I'm unsure about, like class_embed.0.weight.

If anyone has solved this before and is willing to share what they have done it would be helpful for anyone who wants to use deformable_DETR. Regardless, any advice would be welcome (:

daimakram commented 2 years ago

@jackroos, @krxxxxxxxanc, Can you please address this issue? Which parts of the model should we remove to accommodate fine-tuning? Will highly appreciate your response.

daimakram commented 2 years ago

Hi @aviadlazar, were you able to resolve it?

aviadlazar commented 2 years ago

@daimakram sadly I couldn't resolve it.