Some questions: How is the spatial volume obtained during the inference process?

liuyuan-pal / SyncDreamer

[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

https://liuyuan-pal.github.io/SyncDreamer/

MIT License

906 stars 39 forks source link

Some questions: How is the spatial volume obtained during the inference process? #70

Open jianmanlincjx opened 1 month ago

jianmanlincjx commented 1 month ago

Thanks to the authors for the excellent work. I have some confusion: In the training code, the spatial volume is obtained from images of multiple viewpoints, whereas in the inference code, the spatial volume is obtained from multiple noise vectors. Why does the spatial volume derived from noise work during the inference process?

liuyuan-pal commented 1 week ago

Hi, we reproject all features of input views into the volume here.