Tsingularity / dift

[NeurIPS'23] Emergent Correspondence from Image Diffusion
https://diffusionfeatures.github.io
MIT License
594 stars 32 forks source link

About the parameter ensemble_size in SDFeaturizer.forward #4

Closed lihengtao closed 1 year ago

lihengtao commented 1 year ago

Hi! Thanks for your great work. Here I don't understand why the input image is repeated for 8 times. Can ensemble_size be modified to 1?

Tsingularity commented 1 year ago

It's a good question! The forward process (Eq. 2 in the paper, i.e., adding noise to the input image) is stochastic, so adding different randomly-sampled noise and then average-pool the extracted features could further stabilize and slightly boost the performance. This operation is also mentioned in the last paragraph of Sec. 4.2. You can also set to 1 though if you find it doesn't make too much difference in your downstream task.

lihengtao commented 1 year ago

I've got that. Thanks for your reply!