Closed fabienbaradel closed 1 year ago
@fabienbaradel During evaluation, we directly match the ground truth with the most close prediction for evaluation. So I think this refers to all people. As we all know that 3DPW only provide the ground truth for 1 or 2 people in the scene. In many scenes, it lacks detection annotations for all people in the scene, while ROMP and BEV try to predict all people in the scene. So, you know.
Thanks for your interests in our work! We really appreciate it. Best, Yu
Hi @Arthur151 , Thanks for your amazing work. In the supp. mat. of the BEV paper you are reporting result of ROMP and BEV on 3DPW test set (Table 5). But since your method is solving at the same time the detection and the regression I am wondering what kind of detection rate do you get? I am not able to find this information in your paper. And for Table 5 (cf attached) do you compute the metrics on the matched person or on all the persons? Thanks for you help,