fnzhan / Generative-AI

[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era
784 stars 60 forks source link

Research for consideration. #1

Closed afiaka87 closed 2 years ago

afiaka87 commented 2 years ago

Katherine Crowson, v-objective-diffusion. https://github.com/crowsonkb/v-diffusion-pytorch

High-Resolution Image Synthesis with Latent Diffusion Models Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer * denotes equal contribution https://github.com/CompVis/latent-diffusion https://arxiv.org/abs/2112.10752

ruDALL-E https://github.com/sberbank-ai/ru-dalle https://rudalle.ru/en/

Russian Diffusion On Language Picture Hyper-modality RuDOLPH A smaller ruDALL-E that makes use of parameter sharing. (Very new, can't find a paper) https://github.com/sberbank-ai/ru-dolph

Paper for Emojich, which I believe is the associated research with ruDALL-E. https://arxiv.org/pdf/2112.02448.pdf

fnzhan commented 2 years ago

Thanks for your suggestion! I will add these impressive works.