EdwardLeeLPZ / PowerBEV

POWERBEV, a novel and elegant vision-based end-to-end framework that only consists of 2D convolutional layers to perform perception and forecasting of multiple objects in BEVs.
Other
82 stars 18 forks source link

how to train on own dataset? #5

Closed rockywind closed 6 months ago

rockywind commented 10 months ago

Hi, thank you for sharing the great work! I want to train the model on own dataset. Can you give me some advice? My dataset is the same as kitti style.

EdwardLeeLPZ commented 8 months ago

Hi,

Since KITTI is less used in the instance prediction task, my work as well as my baseline model does not include it into the pipeline. If you want to adapt it to the KITTI dataset, you mainly need to modify the dataset part, especially the generation of segmentation and flow GT labels from the bounding boxes.

The main steps are as follows:

  1. Load the inputs for future frames, including the surrounding camera images, their corresponding intrinsic and extrinsic matrices;
  2. Load the annotations for each instance, including the category, size and position of the bounding box;
  3. Load ego vehicle positions for future frames;
  4. Generate segmentation labels from the filtered instance annotations, including semantic segmentations and instance segmentations;
  5. Generate flow labels based on the above segmentation labels;
  6. Generate center and offset labels (only used in FIERY, not in PowerBEV)

Similarly, you also need to change the configuration in other parts to the corresponding KITTI settings.