Open asw91666 opened 1 year ago
However, Iskakov's algebraic model is trained using extrinsic parameters. If you used this pretrained model, you should not claim in Table 1 that FLEX is an extrinsic parameter-free approach.
I agree the Iskakov's algebraic model aggregates the 2D backbone predictions to a 3D voxel using the extrinsics camera parameters, but the 2D backbone is not trained using any of those parameters, so you can use that backbone as a "2D pose estimator from the shelf". We also show at our paper an ablation study using different 2D pose estimation algorithms
I read your paper with great interest, and thank you very much for sharing your code. I had a few questions after reading your paper.
1) According to your paper, you used Iskakov et al, ICCV 2019 model. Iskakov's proposed model includes an algebraic model and a volumetric model, is it correct that you used the algebraic model as a 2d pose estimator?
2) And did you use ResNet152 backbone and 384x384 image size for 2d pose estimation?
Thank you.