Closed FrancoisPorcher closed 5 months ago
We first use eva02_L_pt_m38m_p14to16
to initialize the vision backbone. Then we train this Co-DETR model on Objects365 dataset for intermediate finetuning. Finally, we finetune this model on the LVIS dataset.
I am a bit confused, what exactly do you call intermediate fine-tuning? Is is a fine tuning of the backbone (without detector) with a MIM objective? Or is it starting with the backbone eva02_L_pt_m38m_p14to16 and initialising detector from scratch, and tracing on Object 365 Bounding boxes? Or something else?
Because from what I understand eva02_L_pt_m38m_p14to16 has already seen Obect365 in the retraining phase according to the EVA02 paper (but only the MIM objective and not the bounding box labels).
Would be great if you could clarify this, it's not easy to keep track of all the subtleties! thanks
I am sorry that my previous answer may mislead you.
The first training phase is to train the whole detector (backbone+neck+encoder+decoder+aux branches) on the Objects365 dataset. Specifically, the ViT-L backbone is initialized using eva02_L_pt_m38m_p14to16
, while the other components (neck+encoder+decoder+aux branches) are randomly initialized. The supervision signals are derived from the bounding box coordinates and labels.
Yes thank you that makes a lot of sense now! And if we want to initialize from just the eva02_L_pt_m38m_p14to16.pt
backbone, would you have any advice? The format is not a .pth
, its a .pt
, I'm just wondering which script you used to load this backbone and train the detector from scratch
We just change the init_cfg
in the backbone config. Here is an example:
backbone=dict(
type='ViT',
img_size=1536,
pretrain_img_size=512,
patch_size=16,
embed_dim=1024,
depth=24,
num_heads=16,
mlp_ratio=4*2/3,
drop_path_rate=0.3,
window_size=16,
window_block_indexes=window_block_indexes,
residual_block_indexes=residual_block_indexes,
qkv_bias=True,
use_act_checkpoint=True,
init_cfg=dict(type='Pretrained', checkpoint='models/eva02_L_pt_m38m_p14to16.pt')),
Great thanks! And last question, for LVIS Sota you sent me the config file and the checkpoint, but I dont have access to the mask head, only the box head, would you have it please?
Great thanks! And last question, for LVIS Sota you sent me the config file and the checkpoint, but I dont have access to the mask head, only the box head, would you have it please?
Sure, I will provide you with the original model. However, I'm currently engaged in several other projects, so it might take a bit of time to organize the original model and code for you. I'll get it to you as soon as possible.
Okay thanks! But it's just the weights and the config file for the mask head no?
Thanks for the help
Great thanks! And last question, for LVIS Sota you sent me the config file and the checkpoint, but I dont have access to the mask head, only the box head, would you have it please?
Sure, I will provide you with the original model. However, I'm currently engaged in several other projects, so it might take a bit of time to organize the original model and code for you. I'll get it to you as soon as possible.
Hello CoDETR team!
Im also interested in the instance seg results, would you send me the mask head config and the weights please?
Thanks a lot
Great thanks! And last question, for LVIS Sota you sent me the config file and the checkpoint, but I dont have access to the mask head, only the box head, would you have it please?
Sure, I will provide you with the original model. However, I'm currently engaged in several other projects, so it might take a bit of time to organize the original model and code for you. I'll get it to you as soon as possible.
Hello CoDETR team!
Im also interested in the instance seg results, would you send me the mask head config and the weights please?
Thanks a lot
Please email zongzhuofan@gmail.com to obtain the seg model.
Great thanks! And last question, for LVIS Sota you sent me the config file and the checkpoint, but I dont have access to the mask head, only the box head, would you have it please?
Sure, I will provide you with the original model. However, I'm currently engaged in several other projects, so it might take a bit of time to organize the original model and code for you. I'll get it to you as soon as possible.
Hello CoDETR team! Im also interested in the instance seg results, would you send me the mask head config and the weights please? Thanks a lot
Please email zongzhuofan@gmail.com to obtain the seg model.
I have send an email, looking forward to your reply :)
Hi Co-DETR team!
Could you give the exact link of the backbone of EVA-02 you used before fine tuning on LVIS please? I am assuming its one of the ones here but I am not sure: https://github.com/baaivision/EVA/tree/master/EVA-02/det
Thanks!