Closed Uio96 closed 3 years ago
Hi,
Those steps are used to compute the metric scale of the estimated 3D bounding box with the detected planes in AR session data.
The estimated 3D bounding box from the neural network is only up to a scale, but our ground truth is in metric scale. To compensate for this mismatch, we adjust the scale of the estimated 3D bounding box to make it sit on the same plane as the ground truth 3D bounding box, in that way we determined the metric scale of our estimations.
For more details about the ground truth and the detected planes, please refer to: https://google.github.io/mediapipe/solutions/objectron.html#obtaining-real-world-3d-training-data
Hope this helps
Got you. Thank you so much.
Hi there,
Thanks for your great work. It is really inspiring. I am curious about the scale retrieval process and I found something in your code.
https://github.com/google-research-datasets/Objectron/blob/aa667e689848aa3619e087b493ddb3b919f9e0c8/objectron/dataset/eval.py#L203
I am a little bit confused about the meaning of those steps. Could you explain it?
Thank you so much.