duanyiqun / DiffusionDepth

PyTorch Implementation of introducing diffusion approach to 3D depth perception ECCV 2024
https://arxiv.org/abs/2303.05021
Apache License 2.0
306 stars 17 forks source link

Training encoder decoder #3

Closed dsshim0125 closed 1 year ago

dsshim0125 commented 1 year ago

Hi, thank you for your brilliant works!

I have a question in training the encoder and decoder for mapping the depth to the latent space.

Are they trained beforehand and froze during training denoising process similar to latent diffusion or all the networks (encoder, decoder, and denoising function) are trained all together in an end-to-end manner?

duanyiqun commented 1 year ago

Hi there,

I did have tested the pre-train way and train-in-end2end way. Limited to our experiments, these two methods do not suggest a clear difference.

In this code, all the networks (encoder, decoder, and denoising function) are trained all together in an end-to-end manner.

dsshim0125 commented 1 year ago

Thank you for your fast responses!