marek-simonik / record3d

Accompanying library for the Record3D iOS app (https://record3d.app/). Allows you to receive RGBD stream from iOS devices with TrueDepth camera(s).
https://record3d.app/
GNU Lesser General Public License v2.1
383 stars 57 forks source link

Do 'poses' in 'metadata' refer to world-to-camera transformation (extrinsics) or camera-to-world transformation? #36

Closed andrewsonga closed 2 years ago

andrewsonga commented 2 years ago

Hello,

first of all, thank you for your excellent work with record3d. It really has made extracting RGBD video from the iPad a fluid experience. The question I would like to ask to day is the following:

Do 'poses' in 'metadata' refer to world-to-camera transformation (extrinsics) or camera-to-world transformation?

I'm asking this question because different github issues related to the ARKit poses provide conflicting information. For instance, in issue #31 , t is referred to as the "world pose", which I assumes refers to the coordinates of the world origin in the camera frame. This suggests that [R | t] refers to the world-to-camera transformation (extrinsics).

However, in the same issue, you reply that X_{world} = [R|t] X_{cam}, suggesting that [R | t] actually refers to the camera-to-world transformation.

I would really appreciate it if you could resolve this confusion. Thank you.

marek-simonik commented 2 years ago

Hello,

thank you for using Record3D. The "poses" array holds camera-to-world transformations.