OpenDriveLab / ELM

[ECCV 2024] Embodied Understanding of Driving Scenarios
144 stars 12 forks source link

Question about the usage of box data #14

Closed fang196 closed 3 months ago

fang196 commented 6 months ago

How can I get 'data/embodied/BOXQA_train_v3.json' in this line ?

What is self.temporal_length mean in lavis/datasets/datasets/elm_datasets.py ?

How can I get self.traffic_element_dict in this line?

When will you release file used in this function?

DevLinyan commented 5 months ago

Hi, thanks for your interest.

  1. BOXQA_train_v3.json is essentially the same as box_detection_train.json, accessible through this link.
  2. In the Traffic Sign Inquiry, it denotes the time needed for recognition. For instance, we aim to recognize the traffic sign within a span of 3.5 seconds. In Box Detection, it signifies the duration for temporal fusion. For instance, we use frames from the past 1 second for the box detection task.
  3. self. traffic_element_dict is acquired in this function.
  4. We have uploaded it, accessible through this link.