question on faster r-cnn feature extraction

airsplay / lxmert

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

MIT License

923 stars 157 forks source link

Open theaitetus opened 3 years ago

theaitetus commented 3 years ago

hello thanks for sharing your great work on public.

I have a question on feature for you model.

Which model did you use for feature extraction, faster r-cnn trained on ms-coco or visual genome?

thank you in advance