Closed DerrickXuNu closed 2 years ago
You are correct in that E_inv
already contains the translation.
so $d$ will be in the coordinate system of the ego-vehicle
But we actually want the image embedding to be in the reference frame of the particular camera -
which is why we subtract c_embed
.
This is also why we subtract the same c_embed
when computing the BEV embedding.
But we actually want the image embedding to be in the reference frame of the particular camera - which is why we subtract
c_embed
. This is also why we subtract the samec_embed
when computing the BEV embedding.
Hi,I still have some doubts about this. Why only translation tk is considered and the rotation Rk is ignored?
Hey Brady,
In your encoder.py line 255, you used:
To my understanding, here you want to subtract the camera translation information from the image coordinate embedding. However, I think the translation information is already included in the image coordinate embedding:
where E_inv contains the translation already. So will the subtraction of the c_embed be redundant?