Closed achen46 closed 1 year ago
-G denotes the sampling with classifier-free guidance. It's not a new model but a sampling scheme.
@gasvn Thanks for the response. May I ask what is the classifier-free guidance value you used ?
@gasvn Also I wonder, why you choose to use a total batch size of 256 ? it seems to be more efficient to use a larger batch size like 512 but are there any reasons behind this decision ?
@gasvn Thanks for the response. May I ask what is the classifier-free guidance value you used ?
The sampling parameters are in https://github.com/sail-sg/MDT/blob/main/scripts/image_sample.py
You can use the script in https://github.com/sail-sg/MDT#evaluation to evalute the model.
@gasvn Also I wonder, why you choose to use a total batch size of 256 ? it seems to be more efficient to use a larger batch size like 512 but are there any reasons behind this decision ?
Your are right. But to make fair comparisons with DiT, we also choose the batch size of 256.
@gasvn Thanks for all the prompt answers.
Thanks for great work. In the paper, best FID results are for MDT-G. But I cannot find the config in here.
Can you let us know the exact config for MDT-G ?