Open melih-unsal opened 2 years ago
This looks good. The demo can generate up to 128 samples and shows you the best ones. Make sure you use a conditioning scale of 10.
I just increased it in the notebook.
Thank you @borisdayma
In the notebook, can u say what does dtype=jnp.float16, _do_init=False do? In the playground repository , dtype set to jnp.float32. Which one is the better, float32 or float16?
By the way, is it normal to get around 12 seconds per image generation? If so iI wonder how many instances are you using in demo to generate 128 samples within 30 seconds per user for mega?
Hello, Does the current demo still use mega-1:latest since i got a huge difference with the current demo.
My config is like this:
`DALLE_MODEL = "dalle-mini/dalle-mini/mega-1:latest"
DALLE_COMMIT_ID = None
VQGAN model
VQGAN_REPO = "dalle-mini/vqgan_imagenet_f16_16384" VQGAN_COMMIT_ID = "e93a26e7707683d349bf5d5c41c5b0ef69b677a9"
Load dalle-mini
model, params = DalleBart.from_pretrained( DALLE_MODEL, revision=DALLE_COMMIT_ID, dtype=jnp.float32, _do_init=False )
Load VQGAN
vqgan, vqgan_params = VQModel.from_pretrained( VQGAN_REPO, revision=VQGAN_COMMIT_ID, _do_init=False )`
Do u see any problem with this configuration ?
This is the best result i could with the prompt "aliens travel between planets"
This is the demo result