aim-uofa / GenPercept

GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
https://huggingface.co/spaces/guangkaixu/GenPercept
Creative Commons Zero v1.0 Universal
91 stars 2 forks source link

Question about GenPercept performance #5

Open gwang-kim opened 4 weeks ago

gwang-kim commented 4 weeks ago

Hi there,

Thank you for your great work on GenPercept! The idea of one-step estimation is very promising.

I noticed that the performance of GenPercept seems to be lower than Marigold and GeoWizard. I'm wondering if this difference is due to:

Any insights you could provide would be greatly appreciated!

guangkaixu commented 1 week ago

@gwang-kim Hi, sorry for the late response. It seems that my e-mail didn't notice me this issue in GitHub.

Although we tried to reproduce Marigold by ourselves (the training code of Marigold was not released) , we can hardly reach the performance in their paper. In order to compare fairly, we compare Marigold and GenPercept under the same training setting, and the results can be seen in Table 7 of the main paper. image

"Stochastic MS Gen." here actually means Marigold trained by ourselves, and the performance is slightly worse than the training paradigm GenPercept in the depth estimation task.