eladb3 / ORViT

"Object-Region Video Transformers”, Herzig et al., CVPR 2022
Apache License 2.0
42 stars 12 forks source link

Regarding the object selected for input #15

Open ziqingcheryl opened 1 year ago

ziqingcheryl commented 1 year ago

Hi, Nice to meet you! Great work! I wonder if there is a reason for selecting 4 objects per frame in EpicKitchen, where there can be clearly more than 10 objects in one frame. In this case, how did you select which 4 object information to incorporate into the model?