OFA-Sys / OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Apache License 2.0
2.39k stars 248 forks source link

Poor qualitative results with finetuned text-to-image generation model #402

Open mshukor opened 1 year ago

mshukor commented 1 year ago

Hello,

I am testing the provided finetuned BASE text-to-image generation model and the quality of the generated images are very poor compared to those in the paper. I have similar (but slightly better) results with the large model. It would be great if the authors can point to the problem or provide a script with the same hyperparameters used to generate the images in the paper.

I am using the provided image_gen_example.py.

Here are some results using the query in Fig. 3 in the paper: A street scene with a double-decker bus on the road._0: image Cattle grazing on grass near a lake surrounded by mountain._0: image A brown horse in the street_0: image

Thanks in advance