Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Other
2.24k
stars
263
forks
source link
Notebook for parsing Objectron's geometry files. #9
Adds a notebook for showing how to parse Objectron's geometry.pbdata, Tutorial on how to project 3D points (box or scene point-cloud) on the image.