yrcong / RelTR

RelTR: Relation Transformer for Scene Graph Generation: https://arxiv.org/abs/2201.11460v2
229 stars 45 forks source link

About training strategy #36

Closed Vincent-luo closed 7 months ago

Vincent-luo commented 8 months ago

Hi, I'm now training the reltr model on VG dataset and I find the training time is quite long. It takes ~2.5 days to train for 150 epochs on 4*3090 with batchsize 4. Im not sure whether I'm doing something wrong or it does need much time to train from scratch.

And I want to ask if you have tried other training strategies like multiple stage. For example, in the first stage just train the model for object detection, the in the second stage only train the triplet decoder and freeze the encoder and entity decoder(or updating with a low leaning rate). That sounds more practical and will reduce the training time in theory.

yrcong commented 8 months ago

Hi, We didn't do it since the end-to-end training is our objective. But the multiple-stage training sounds reasonable in practice.

You can even try to load the pretrained DETR.