Depth scale - Githubissues

oscarmcnulty / gta-3d-dataset

A dataset of 2D imagery, 3D point cloud data, and 3D vehicle bounding box labels all generated using the Grand Theft Auto 5 game engine.

134 stars 15 forks source link

The depth matrix you are seeing is likely in the "clip" space. To transform to the view space (regular cartesian coords relative to camera position) you need to transform it using the "projection" matrix which should also be captured from the game engine. The units for the view space coordinate system and whether you need to scale the raw depth values to [0, 1] before transforming would depend on how the projection matrix and depth matrix are captured/stored.

To transform clip space to view space for this dataset you can see the example code here

There is some detail in the thesis on page 26 about what clip space is and how the rendering pipeline works.

oscarmcnulty / gta-3d-dataset

Depth scale #5