wzzheng / TPVFormer

[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
https://wzzheng.net/TPVFormer/
Apache License 2.0
1.2k stars 108 forks source link

the performance of your released occupancy model tpv04_occupancy #39

Open SISTMrL opened 1 year ago

SISTMrL commented 1 year ago

image hello, i run the inference of tpv04_occupancy with command, the gpu i used is v100

python eval.py --py-config config/tpv04_occupancy.py --ckpts ckpt/tpv04_occupancy_v2.pth

the performance on nuscenes is shown in the picture.

the performance is not matched with paper released and could you please explain the evaluation metrics of miou vox/pts, i'm confused about it.

huang-yh commented 1 year ago

We use different supervision signals for lidar segmentation and occupancy prediction, and thus yielding different model weights for these two tasks. For lidar segmentation, lovasz softmax loss supervises point predictions and crossentropy loss supervises voxel predictions. While for occupancy prediction, both of them supervise voxel predictions.

huang-yh commented 1 year ago

As for pts miou and vox miou, they are not two metrics, but two different ways to derive predictions for point queries. Pts miou interpolates voxel features to explicitly generate point features before feeding them to the decoder, while in vox miou, we simply assign the query point the prediction for the voxel which it belongs to.