Closed imxtx closed 7 months ago
Hi, please check the finetune code for CelebV-HQ. I think it is feasible to adapt your dataset to it.
Thank you very much. I'll look into it.
Hi authors, I tried to preprocess my custom videos like the celebv-hq dataset, but I found that the output video jitters between frames because the face cropping bounding boxes are different between frames. I'm wondering if it is okay to use them to finetune.
Hi, you can have a try with the naive frame-level cropping. If it the finetune performance doesn't work well, you can try to fix the crop size to make it have less "jitters".
I want to download the pretrained model. May I ask what does Encoder MACs mean? Thank you!
MACs is Multiply-Accumulate Operations, to measure the time complexity of the model.
MACs is Multiply-Accumulate Operations, to measure the time complexity of the model.
Thank you very much.
Thank you for this great work! I have a video classification task, how to finetune the pre-trained MARLIN encoder? I have videos with an average length of 5 mins. What should I do to prepare the dataset? What parts of code should I modify?