alexsax / 2D-3D-Semantics

The data skeleton from Joint 2D-3D-Semantic Data for Indoor Scene Understanding
http://3dsemantics.stanford.edu
Apache License 2.0
464 stars 67 forks source link

Questions about 3D Bbox #14

Closed Jiankai-Sun closed 6 years ago

Jiankai-Sun commented 6 years ago

Hi, Stanford team,

Firstly, thank you for providing such a great dataset (http://buildingparser.stanford.edu/dataset.html)!

We are confused about the provided Bbox coordinates of object stored in the 'Area_#/3d/pointcloud.mat' file.

In each object struct (Area --> Disjoint-Space --> object):

As described, [Xmin Xmax Ymin Ymax Zmin Zmax] is the object's bounding box. However, why are the coordinates of different objects are so similar as the attached screenshot shows. It seems like there are something wrong. Is there any other operations we need to do to get the correct 3D Bbox groundtruth? Could you please provide a specific README/Instructions to guide users correctly utilize the coordinates of 3D bounding box, both in world coordinate system and camera coordinate system?

Thank you for your reply!

ir0 commented 6 years ago

Hi Jiankai,

From the screenshot you attached in the email you sent to us (attaching below for reference), I see that there is no problem with the values (in meters), however there seems to be an issue with the order of the variables. The variables should read: [Xmin Ymin Zmin Xmax Ymax Zmax]. Thanks for letting us know, we'll update the documentation. Let us know if this didn't work for you.

Best, Iro

bbox

Jiankai-Sun commented 6 years ago

Thank you for your quick reply!