Closed dengyang11 closed 5 months ago
In this case, the original pred_cam[:,0]
value corresponds to s
, the scaling factor of the weak perspective projection, which approximates f/Z
. So the depth of the human is Z = f/s
. Then, we also divide by the factor bbox_size/2
, so that we project the human to [-0.5,0.5]
.
Thanks again. In addition, why focal length changes with image size? Thanks
You can use an arbitrary focal length value when you use the above equation. We adopt the design decisions of ProHMR. Note that self.cfg.MODEL.IMAGE_SIZE
is constant (set to 256).
For the demo code, this is just a design choice to visualize the results with larger focal length values in general. You could experiment with other values too.
Thanks again
Then, we also divide by the factor
bbox_size/2
, so that we project the human to[-0.5,0.5]
.
You mean normalize to [-1, 1]
? @geopavlakos
And I think it's more proper to normalize by bbox_size instead of image size. It's the bbox size which is resized to MODEL.IMAGE_SIZE
.
Hi, thanks for your wonderful work!
I am wondering that why the pred_cam_t[3] is
I look forward to your reply, Thanks