Closed MengHao666 closed 2 years ago
Could anyone here answer my question?
RootNet predicts absolute 3D wrist location in the camera-centered coordinate system from a cropped hand image. To crop the hand image, GT bbox is used. That is why both have the same GT bboxes.
RootNet predicts absolute 3D wrist location in the camera-centered coordinate system from a cropped hand image. To crop the hand image, GT bbox is used. That is why both have the same GT bboxes.
Got it! Thanks for kind reply!
I found the bbox here is ame with the bbox here. Could you give some explainantions?