pipeline question / paper question

facebookresearch / ViewDiff

ViewDiff generates high-quality, multi-view consistent images of a real-world 3D object in authentic surroundings. (CVPR2024).

Other

311 stars 20 forks source link

pipeline question / paper question #16

Closed sararoma95 closed 5 months ago

sararoma95 commented 5 months ago

Hello,

I’m reviewing the network proposed in your paper for rendering multiple images from a single/two images. I noticed in the pipeline diagram that the input and output images seem swapped between the top and bottom parts. Is this arrangement intentional, or could it be a diagram error?

Could you clarify:

Whether the image arrangement in the diagram is intentional.
The reasoning behind this configuration, if it's intentional.

Thank you for your help and for the interesting paper.

lukasHoel commented 5 months ago

Hi, thanks for the question. It's a slight inaccuracy in the pipeline figure. We compare the predicted and sampled noise for the same image.

sararoma95 commented 5 months ago

Thanks