instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
https://instantid.github.io/
Apache License 2.0
10.85k stars 791 forks source link

About Identity Similarity... #9

Open renderless opened 8 months ago

renderless commented 8 months ago

In technical report, Fig. 6, "Jackie Chan" does not looks like reference image especially his nose. I supposed antelopev2 model should able to extract Jackie Chan very well as his images should be in Glint360k training data. Is this the limitation of face id encoder?

Also, the picture quality seems over saturated. Is it because of SDXL base model or your prompt?

haofanwang commented 8 months ago

It should be related to the prompt or weight scale. We don't tune the parameters carefully. Anyway, talk is easy, we will show you the code soon.

wangqixun commented 8 months ago

In technical report, Fig. 6, "Jackie Chan" does not looks like reference image especially his nose. I supposed antelopev2 model should able to extract Jackie Chan very well as his images should be in Glint360k training data. Is this the limitation of face id encoder?

Also, the picture quality seems over saturated. Is it because of SDXL base model or your prompt?

It may be a problem with prompt. I changed the prompt and base model to generate a new image. At the same time, our model is constantly being optimized. 9cdf5f50b13a42dbcd7012eae0eec3129bc2af0cb06655667fc414c8

lucasjinreal commented 8 months ago

@wangqixun which model were using here?

wangqixun commented 8 months ago

@wangqixun which model were using here?

base model = https://civitai.com/models/43977?modelVersionId=227916

prompt = "cinema 4d render, high contrast, vibrant and saturated, sico style, dark and moody close-up shot of a handsome Saint-Pierrais man with a tired expression, (renaissance theme:1.1), colorful northern warrior, (glowing eyes:1.05), dynamic pose, hooded robe, surrounded by magical glow, floating ice shards, snow crystals, cold, windy background, frozen natural landscape in background cinematic atmosphere, highly detailed, sharp focus, intricate design, 3d, unreal engine, octane render, CG best quality, highres, photorealistic, dramatic lighting, artstation, concept art, cinematic, epic Steven Spielberg movie still, sharp focus, smoke, sparks, art by pascal blanche and greg rutkowski and repin, trending on artstation, hyperrealism painting, detailed character design, matte painting, 4k resolution"

neg prompt = "asian, (worst quality, low quality, thumbnail:1.4), signature, artist name, web address, cropped, jpeg artifacts, watermark, username, collage, grid, nude, topless, nsfw, naked, nipples"

The style is not very stable. Generate the image again. 044548e249d4722a80c19e0aa73333b15673915718d92a0480e13d52-2

wangqixun commented 7 months ago

@wangqixun which model were using here?

base model = https://civitai.com/models/43977?modelVersionId=227916

prompt = "cinema 4d render, high contrast, vibrant and saturated, sico style, dark and moody close-up shot of a handsome Saint-Pierrais man with a tired expression, (renaissance theme:1.1), colorful northern warrior, (glowing eyes:1.05), dynamic pose, hooded robe, surrounded by magical glow, floating ice shards, snow crystals, cold, windy background, frozen natural landscape in background cinematic atmosphere, highly detailed, sharp focus, intricate design, 3d, unreal engine, octane render, CG best quality, highres, photorealistic, dramatic lighting, artstation, concept art, cinematic, epic Steven Spielberg movie still, sharp focus, smoke, sparks, art by pascal blanche and greg rutkowski and repin, trending on artstation, hyperrealism painting, detailed character design, matte painting, 4k resolution"

neg prompt = "asian, (worst quality, low quality, thumbnail:1.4), signature, artist name, web address, cropped, jpeg artifacts, watermark, username, collage, grid, nude, topless, nsfw, naked, nipples"

The style is not very stable. Generate the image again. 044548e249d4722a80c19e0aa73333b15673915718d92a0480e13d52-2

Correct it

base model = https://civitai.com/models/84040?modelVersionId=196039