Open Angus-Qi opened 10 months ago
I have the same question. Hope for the author's reply. Thanks! @paulpanwang @zhiwenfan
I seem to have figured out what these two txt files mean. The 3x4 matrix in prompt.txt represents the rotation matrix (3x3) and translation (3x1) of the object relative to the camera when capturing the prompt.png image. Similarly, target.txt represents the rotation matrix and translation of the object relative to the camera when capturing the target.png image. @Angus-Qi
@ddz16, thank you for your response. Where can we see a definition? If we know the pose of the object in the target image, what is being calculated?
Thanks for the contribution and releasing the code for this project, the work done is really interesting.
Regarding the visual_3dbbox.py demo, could you explain what are the prompt.txt and target.txt? I look forward to testing on other prompt and target images.