cshizhe / VLN-DUET

Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).
114 stars 7 forks source link

How to extract object features #15

Open zhangjb416 opened 1 year ago

zhangjb416 commented 1 year ago

Hi, could you please provide the object feature extraction code for REVERIE/SOON dataset?

It is mentioned in the paper that "For objects, we use the same ViT on the REVERIE dataset as it provides bounding boxes, while we use the BUTD object detector on the SOON dataset." Did you crop each object from the panorama and extract its feature using ViT?