hitachinsk / FGT

[ECCV 2022] Flow-Guided Transformer for Video Inpainting
https://hitachinsk.github.io/publication/2022-10-01-Flow-Guided-Transformer-for-Video-Inpainting
MIT License
300 stars 31 forks source link

Can't be cleaned up, is it an issue with the pre-trained model? #43

Closed wxllyf closed 11 months ago

wxllyf commented 11 months ago

FGT 微信截图_20231007121612

other 微信截图_20231007121405

What steps have I missed? To shorten the training time, can I train only on the sequence frames of a single video?

hitachinsk commented 11 months ago

To use FGT model, you should provide a mask to indicate the regions you want to remove, otherwise the model cannot know which region should be removed. Moreover, FGT cannot be trained with a single video because the model needs more data to achieve a better result.

wxllyf commented 11 months ago

Both screenshots use the same frames and masks, so it feels strange. I learned about FGT from recommendations in the comments of another project. Below is the original image. 微信截图_20231007163306

The command I used is:python video_inpainting.py --path images --path_mask masks --outroot test --imgH 512 --imgW 288

hitachinsk commented 11 months ago

It's so strange, because the original content is left in the inpainted frame. If the mask covers all the content in all the frames, the unwanted content should not present in the result. Since FGT propagates the content across all the frames in the videos, please check the masks across all the frames carefully. I think some of the content are not be covered in some of the frames, therefore the result degrades.

wxllyf commented 11 months ago

Reference in

Thank you very much, I will compare and compare what went wrong.