tusen-ai / SST

Code for a series of work in LiDAR perception, including SST (CVPR 22), FSD (NeurIPS 22), FSD++ (TPAMI 23), FSDv2, and CTRL (ICCV 23, oral).
Apache License 2.0
801 stars 102 forks source link

WOD version of 3-frame waymo config #85

Open shawnding opened 1 year ago

shawnding commented 1 year ago

Hi @Abyssaledge ,

I noticed some differences in configs of 3-frame and single-frame FSD on waymo. It seems like different versions of WOD dataset is used (because the in_channels is different). I have the following questions

image

Thanks!

Abyssaledge commented 1 year ago
  1. Single-frame model and multi-frame model share the same waymo_dbinfos_train.pkl
  2. tanh_dims normalizes intensity and elongation (3rd and 4th channel). In 1f config, they are also normalized by default.
  3. 3f model has much more points, so we reduce the number of channels to 32.
shawnding commented 1 year ago

Thanks! A following question, does the 6th channel represents range frame offset (according to this)? Does it make a difference to the detection performance?

Abyssaledge commented 1 year ago

The 6th channel is the relative timestamp, implemented in LoadPointsFromMultiSweepsWaymo.