daveredrum / ScanRefer

[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
https://daveredrum.github.io/ScanRefer/
Other
236 stars 28 forks source link

Camera viewpoint correspondence to the object description #6

Closed ayushjain1144 closed 3 years ago

ayushjain1144 commented 4 years ago

Hi Dave,

I was wondering if there is a linkage between scanrefer reference query and a camera viewpoint for answering the question. For example, if the query is "there is a fridge to the left of bed", then is this query attached to a particular camera viewpoint which has fridge and bed and thus the query can be grounded? Are the questions linked to the camera viewpoint in some way? Or is there a way to find that viewpoint easily?

To give you the context, our work is based on rgbds, so at test time our model expects an RGB-D image from the relevant camera viewpoint and the query which needs to be answered.

Thank you

cheng052 commented 3 years ago

Similar question! I also wonder how the viewpoint correspondence to the referring expression?

Hiusam commented 1 year ago

Same question!

daveredrum commented 1 year ago

Hi @ayushjain1144 @cheng052 @Hiusam, we've released the annotated viewpoints - Please check it out here. Have fun playing around the viewpoints! :)

Hiusam commented 1 year ago

Great! Thank you, @daveredrum.