When I evaluated the authors' provided ILD_Coeff model, I did not get the F1 score of 0.787 reported in table 4 of the paper. Instead I got F1 score of 0.8056.

This is the exact output of the eval:

precision 0.7053766470275624, recall 0.5420338264165887, fscore 0.6130106817959742

culane laneatt metric evaluation start!

Loading prediction data... 100%|████████████████████████████████████████████████████████████████████████| 23021/23021 [00:08<00:00, 2827.61it/s] Loading annotation data... 100%|████████████████████████████████████████████████████████████████████████| 23021/23021 [00:09<00:00, 2302.34it/s] Calculating metric in parallel... 100%|█████████████████████████████████████████████████████████████████████████| 23021/23021 [02:50<00:00, 134.98it/s] ====================Results (datalist.txt)==================== TP: 62200 FP: 13849 FN: 16172 Precision: 0.8179 Recall: 0.7937 F1: 0.8056

culane laneatt metric evaluation done!

But I do get the authors' reported F1 score when I evaluate their provided PLD model, with an F1 score of 0.8249 which matches the paper's number in table 4 of 0.825.

This is the exact output of the eval for PLD: precision 0.6302282598102822, recall 0.5900906007422002, fscore 0.6094992939678525

culane laneatt metric evaluation start!

Loading prediction data... 100%|█████████████████████████████████████████| 23021/23021 [00:07<00:00, 2891.16it/s] Loading annotation data... 100%|█████████████████████████████████████████| 23021/23021 [00:10<00:00, 2285.51it/s] Calculating metric in parallel... 100%|██████████████████████████████████████████| 23021/23021 [02:47<00:00, 137.51it/s] miou ===> 0.7256893668357335 ====================Results (datalist.txt)==================== TP: 64516 FP: 13541 FN: 13856 Precision: 0.8265 Recall: 0.8232 F1: 0.8249

culane laneatt metric evaluation done!

I am wondering if this is expected or if I did something wrong? Because this is not as much of an improvement from single-frame to video-based detection as is reported in the paper.

Thanks.

dongkwonjin / RVLD

Authors ILD_Coeff model does not give paper's results for F1 score #13