Great works. A question about detailed inference settings for animatediffv2.

jiaxiangc commented 3 months ago

Hi, we are testing animatediffv2 model in your benchmark. Here are some questions.

We see you provied a lora link, but i find it is realistic checkpoint v5.1 (unet not lora weights.)
What is your inference steps. We want to reproduce a similar results for animatediffv2, but the results are so far from the benchmark results.

ziqihuangg commented 3 months ago

Hi, the detailed setting for sampling is recorded here: https://github.com/Vchitect/VBench/tree/master/sampled_videos#what-are-the-details-of-the-video-generation-models Feel free to let us know any further questions.

jiaxiangc commented 3 months ago

Here is a lora link that you provide. But i find it is a checkpoint, not a lora. https://civitai.com/models/4201?modelVersionId=130072 When you inference animatediffv2, what unet model you use?

jiaxiangc commented 3 months ago

By the way, how to calculate total score? I compute the average score, the results are 64.67 (but 80+ in huggingface). Does it weighted value?

DZY-irene commented 3 months ago

Here is a lora link that you provide. But i find it is a checkpoint, not a lora. https://civitai.com/models/4201?modelVersionId=130072 When you inference animatediffv2, what unet model you use?

Hello, we are using the configs/prompts/v2/v2-1-RealisticVision.yaml for generation. Here are our settings:


- inference_config: "configs/inference/inference-v2.yaml"
  motion_module:    "models/Motion_Module/mm_sd_v15_v2.ckpt"

  dreambooth_path: "models/DreamBooth_LoRA/realisticVisionV60B1_v51VAE.safetensors"
  lora_model_path: ""

  seed:           2 #[13100322578370451493, 14752961627088720670, 9329399085567825781, 16987697414827649302]
  steps:          25
  guidance_scale: 7.5

  prompt: # VBench_prompt
  n_prompt:
    - "semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime, text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck"

Please download the Pruned Model fp16 (1.99 GB) from https://civitai.com/models/4201?modelVersionId=130072 and place it in the corresponding directory, then set its path in dreambooth_path.

jiaxiangc commented 3 months ago

@DZY-irene @ziqihuangg Get it. Do you know how to compute total score? It is not averaged score for 16 dimension scores.

jiaxiangc commented 3 months ago

@DZY-irene @ziqihuangg Hi, what is the correct json format？ The results are abnormal. How to withdraw it？https://huggingface.co/spaces/Vchitect/VBench_Leaderboard All results are 0.

jiaxiangc commented 3 months ago

@ziqihuangg @DZY-irene Thanks for your work. Here are some question for submit.

How to fill in "source"? When I upload the .zip files, the "source" is user upload.
What is the correct json format. The following are our results in a json file. We upload a zip directory containing 16 dimension json file. But the results are 0.
How to delete upload history? The results are abnormal....

Best wishes.

yinanhe commented 3 months ago

@jiaxiangc For "source," you can fill it in the "team name." For "source," you can fill it in the "team name."

jiaxiangc commented 3 months ago

@yinanhe Thanks, I get it.

Vchitect / VBench

Great works. A question about detailed inference settings for animatediffv2. #44