Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Other
2.24k
stars
263
forks
source link
how to get the bounding box in the world-coordinate system #60
I hope to transform the bounding box through following formula you give to the box in the world coordinate system.
rotation scale keypoints + translation
But I can't get the same result as the POINT-2D you have provided. I hope you can tell me how to get the correct result.