ygjwd12345 / TransDepth

Code for Transformers Solve Limited Receptive Field for Monocular Depth Prediction
MIT License
171 stars 20 forks source link

Why design latent kernel structure #6

Closed mzy97 closed 3 years ago

mzy97 commented 3 years ago

Thank you for sharing this wonderful work. In my opinion, the AGD module is used to select low-level features in the encoder to enhance the feature (encoder output), perform spatial and channel attention will be enough. Why design latent kernel mechanism, what is the motivation and actual improvement using latent kernel?

ygjwd12345 commented 3 years ago

Projecting a feature to latent space is often better than using the feature directly in practice. It is not our novelty, and we don't talk about it a lot.