Open songxueXS opened 2 years ago
More precisely, this is the code line from the Improved_VQ-Diffusion branch. The cf_predict_start function is not defined in the DiffusionTransformer class. Is it the same as the one from the dalle class? https://github.com/cientgu/VQ-Diffusion/blob/fe79083818b47d4d376ab9579ec19cba2a43c3cb/image_synthesis/modeling/models/dalle.py#L170
More precisely, this is the code line from the Improved_VQ-Diffusion branch. The cf_predict_start function is not defined in the DiffusionTransformer class. Is it the same as the one from the dalle class?
https://github.com/cientgu/VQ-Diffusion/blob/fe79083818b47d4d376ab9579ec19cba2a43c3cb/image_synthesis/modeling/models/dalle.py#L170 Thank you very much for your reply, but I find they are not exactly the same and 'def cf_predict_start(log_x_t, cond_emb, t)' depends on many parameters. Can you give a full version?
Sorry for late reply. The cf_predict_start function is defined in generate_content in dalle.py for classifier-free sampling, but while training we don't need classifier-free sampling and it is not executed. By adding a placeholder in diffusion_transformer.py it should be solved:
def cf_predict_start(self, log_x_t, cond_emb, t):
return self.predict_start(log_x_t, cond_emb, t)
Hi author, thanks for sharing your inspiring work! I'm trying to make re-implementation of Improved VQ-Diffusion. I have the problem.