Open unnamed333user opened 1 month ago
Our work begins with SVD. We assume that SVD has processed a large volume of multi-view images during training. Building on this, we further trained our model on the multi-view dataset from Objaverse to fully leverage SVD's capability for generating multi-view images. For more details, please refer to our paper.
tks for sharing such a great work! I have a question that how can the model generate the back side of the object if we input only front side image of the object ?
I think the back views is random but natural generations by stable video diffusion or it is overfitting by the Objaverse. I see many work on dataset Objaverse, and there are all not that real.
tks for sharing such a great work! I have a question that how can the model generate the back side of the object if we input only front side image of the object ?