dongkwonjin / RVLD

Official code for "Recursive Video Lane Detection", ICCV 2023
Apache License 2.0
71 stars 7 forks source link

Authors ILD_Coeff model does not give paper's results for F1 score #13

Open aj96 opened 4 months ago

aj96 commented 4 months ago

When I evaluated the authors' provided ILD_Coeff model, I did not get the F1 score of 0.787 reported in table 4 of the paper. Instead I got F1 score of 0.8056.

This is the exact output of the eval:

precision 0.7053766470275624, recall 0.5420338264165887, fscore 0.6130106817959742

culane laneatt metric evaluation start!

Loading prediction data... 100%|████████████████████████████████████████████████████████████████████████| 23021/23021 [00:08<00:00, 2827.61it/s] Loading annotation data... 100%|████████████████████████████████████████████████████████████████████████| 23021/23021 [00:09<00:00, 2302.34it/s] Calculating metric in parallel... 100%|█████████████████████████████████████████████████████████████████████████| 23021/23021 [02:50<00:00, 134.98it/s] ====================Results (datalist.txt)==================== TP: 62200 FP: 13849 FN: 16172 Precision: 0.8179 Recall: 0.7937 F1: 0.8056

culane laneatt metric evaluation done!

But I do get the authors' reported F1 score when I evaluate their provided PLD model, with an F1 score of 0.8249 which matches the paper's number in table 4 of 0.825.

This is the exact output of the eval for PLD: precision 0.6302282598102822, recall 0.5900906007422002, fscore 0.6094992939678525

culane laneatt metric evaluation start!

Loading prediction data... 100%|█████████████████████████████████████████| 23021/23021 [00:07<00:00, 2891.16it/s] Loading annotation data... 100%|█████████████████████████████████████████| 23021/23021 [00:10<00:00, 2285.51it/s] Calculating metric in parallel... 100%|██████████████████████████████████████████| 23021/23021 [02:47<00:00, 137.51it/s] miou ===> 0.7256893668357335 ====================Results (datalist.txt)==================== TP: 64516 FP: 13541 FN: 13856 Precision: 0.8265 Recall: 0.8232 F1: 0.8249

culane laneatt metric evaluation done!

I am wondering if this is expected or if I did something wrong? Because this is not as much of an improvement from single-frame to video-based detection as is reported in the paper.

Thanks.

dongkwonjin commented 4 months ago

As you pointed out, ILD may yield better performances in the image metrics, compared to PLD.

However, in terms of video metrics, such as flickering and missing rates, ILD yields inferior results than PLD.

In other words, PLD provides temporally more stable detection results of lanes than ILD.

Thank you for your attention.