google-research-datasets / Objectron

Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Other
2.24k stars 263 forks source link

Some sequence shards have missing fields #40

Open smauq opened 3 years ago

smauq commented 3 years ago

When going over the sequence shards, some of them have missing fields. The loading being done as it's described in the example script, the code crashes with the following error message

InvalidArgumentError: Name: <unknown>, Feature list 'image/encoded' is required but could not be found.  Did you mean to include it in feature_list_dense_missing_assumed_empty or feature_list_dense_defaults?
     [[{{node ParseSingleSequenceExample/ParseSequenceExample/ParseSequenceExampleV2}}]]

And the faulty shards I found by iterating over the entire dataset are (the missing fields differ between the affected shards):

objectron/sequences/book/book_train-00020-of-01324
objectron/sequences/book/book_train-00476-of-01324
objectron/sequences/book/book_train-01231-of-01324
objectron/sequences/book/book_train-01256-of-01324
objectron/sequences/bottle/bottle_train-00559-of-01320
objectron/sequences/chair/chair_train-00365-of-01274
ahmadyan commented 3 years ago

I'll look into this.