google-research-datasets / Objectron

Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Other
2.23k stars 263 forks source link

Why swap x and y when project points from 3d to 2d #52

Open gold-mango opened 2 years ago

gold-mango commented 2 years ago

According to Objectron Geometry Tutorial , project points from 3d to 2d have two steps x = p_2d_ndc[:, 1] y = p_2d_ndc[:, 0] x comes from 1 dim, y comes from 0 dim, why swap x and y here?

ahmadyan commented 2 years ago

images are recorded in portrait mode, but the camera poses are in landscape.