tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Apache License 2.0
4.46k stars 289 forks source link

how much time will convergence #369

Open xiaowen0110 opened 1 month ago

xiaowen0110 commented 1 month ago

I'm trying use ip-adapter to pixart-sigma. But it seems like not convergence. (lr=1e-5, batch_size=10*8). after 4k steps, it does not convergence, and the generated image is collapsed at any training time. I noticed that you can get a good result when training step is 200k steps. What does your results looked like in the begin of training . image (ps: the smaller lr can convergence more ( stop at about 8k steps), but the result is still collapsed too.) image