nianticlabs / mickey

[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
https://nianticlabs.github.io/mickey/
Other
498 stars 31 forks source link

How to perform image pair matching #4

Closed pbmsmtmt closed 6 months ago

pbmsmtmt commented 7 months ago

Thank you for your work. I would like to ask if the input is an image pair, one as a reference image and the other as a source image, can the source image be converted to the perspective of the reference image through the Mickey model? Similar to solving the transformation matrix between the two? Please tell me how to implement it in the code, because the demo seems to be more inclined to output the depth map and the confidence score map, but it does not give me the conversion result. Looking forward to your reply!

axelBarroso commented 7 months ago

Hello!

MicKey uses one single image as input, and as you mentioned, it returns:

We provide the demo code to compute the metric relative pose (R, t), given two images: the reference and the query image.

The relative pose can be used to warp one image into the other. You would need to use the depth maps for that too. Note that depth maps are not accurate in the whole image, but only where the keypoint score is high.

axelBarroso commented 6 months ago

Closing this issue since it has been inactive for over a week - do please open it again if you have more questions. Thanks!