How to add text to an image

Text-to-image diffusion models are bad at generating text in images because they are not trained on a dataset of images that contain text. These models are typically trained on a dataset of images that do not contain any text, so they do not have the ability to learn how to generate text in images.

In addition, text-to-image diffusion models are typically trained to generate images that are realistic and visually appealing. This means that they are focused on generating images that have the correct colors, shapes, and textures. They are not focused on generating images that contain text, so they may not be able to generate images that have text that is readable or understandable.

ai-forever / Kandinsky-2

How to add text to an image #81