Open nityanandmathur opened 8 months ago
I just built a codebase called IGLIGEN that I use to train GLIGEN on SDv1.5/2.1. It also supports ModelScope (text-to-video generation). It is trained with SA-1B dataset which is only ~300GB after preprocessing and has 11M images. It is more "modern" (i.e., it is based on diffusers training script) and supports flash attention. The dataset, preprocessing script, and training script are included in the repo. Feel free to contact me for suggestions for this repo.
Hello! Since, GLIGEN is trained on CompVis/SD-1.4, could you please let me know how it can be trained using Stability AI's stable diffusion?