Is only the last layer of the edited frame processed?

omerbt / TokenFlow

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

MIT License

1.58k stars 137 forks source link

Thanks for your nice work! I have two questions. The first question, the paper mentioned that each layer of the key frames has been processed. So, when editing the original video frame, is every layer also processed, or is only the last layer processed. Second question, I understand that the processing of video frames should be carried out step by step, and the result of the processing of the previous step will be output as the next step. So according to the understanding of the paper, all frames should be processed in each step, is it right?

I look forward to your reply. Thank you again.

omerbt / TokenFlow

Is only the last layer of the edited frame processed? #6