eladb3 / ORViT

"Object-Region Video Transformers”, Herzig et al., CVPR 2022
Apache License 2.0
42 stars 12 forks source link

How are bboxes been extracted for EK dataset? #1

Closed Randle-Github closed 2 years ago

Randle-Github commented 2 years ago

I noticed that your README file didn't mention the preparation of Epic-Kitchens dataset and didn't see the pre-trained online detector part in your code. Feel confused about the method you process on this problem. Thank you!

eladb3 commented 2 years ago

Hi, this is the folder containing the tracker code: https://github.com/eladb3/ORViT/tree/master/slowfast/utils/LinkBoxes

For Epic-Kitchens, an H5 file containing the processed boxes should be created during the dataloader generation, assuming cfg.EPICKITCHENS.VISUAL_DATA_DIR contains the boxes provided here: https://github.com/epic-kitchens/epic-kitchens-100-hand-object-bboxes

Randle-Github commented 2 years ago

Thank you!