Open OrangeSodahub opened 3 months ago
I don't think current pipeline can be used to train images without depths.
@Tangshitao Sorry I don't exactly understand what you mean. Let me clarify something, I mean using depths to calculate the correspondence but don't use depths as the condition when generating images. Do you mean without depth cond. the results are not expected as with depth cond.?
Hi, I'm very interested in your work. And I want to know that if I could train your
depth
version MVDiffusion model but using SD without depth cond. (e.g. SDv1.5)? And if yes, do you have any advice on how to sample images from the entire camera trajectory to perform consistency well according to your experience. And the last question is, if the precision of depth values matter in training process? Maybe I use some depth values from bounding box to train.