bytedance / MVDream

Multi-view Diffusion for 3D Generation
MIT License
780 stars 57 forks source link

Curious about the guidance scale used in your practice #6

Closed lizhiqi49 closed 11 months ago

lizhiqi49 commented 1 year ago

Hi I have a question about the guidance scale.

I have tested the 2.1 model in my machine, and I find that the generation quality is very bad when guidance scale is relatively small (<20), and the quality was getting better and better as I added it along to 60. Below shows some of my testing results.

guidance scale = 10: image

guidance scale = 15: image

guidance scale = 20: image

guidance scale = 40: image

guidance scale = 60: image

Since I notice that in your T2I script you set the guidance scale defaultly as 10, I'm wondering if I made anything wrong.

And the backgrounds in the images above are all same brown color, is it correct?

lizhiqi49 commented 1 year ago

And the sampler I used is 50-steps DDIM, with the prompt as "an astronaut riding a horse, 3d asset", and unconditional prompt as null ("").

seasonSH commented 1 year ago

which script are you using and did you change anything?

lizhiqi49 commented 12 months ago

Actually its on my custom diffusers code since I'm more familiar with diffusers, I converted your ldm checkpoint to diffusers version... I think it unlikely generates such reasonable results if there is something wrong in my code.

But if the issue I met is unmatched with your practice, I will go to try your official code and check mine.

lizhiqi49 commented 12 months ago

I've tried your official code and everything goes well, so there must be something wrong in my code, I'll check it. hhh