Doesn't work on any of my videos

ecsplendid commented 3 years ago

When I use your src.mp4 i.e.

!gdown https://drive.google.com/uc?id=1tCEk8FE3WGrr49cdL8qMCqHptMCAtHRU -O /content/src.mp4 -q

It works great

However when I use one of my videos (h264 1080) it doesn't work at all

This is the alpha:

From this input:

From running this command (notice I also get an error message but video still produced)

ecsplendid commented 3 years ago

Here is the video I made to reproduce the problem

!gdown https://drive.google.com/uc?id=1q2MzIcsXIknxzKJSms9VxU2icVz3YV8R -O /content/tim.mp4 -q

I can reproduce it on your Google Collab too so it's not my machine/environment

PeterL1n commented 3 years ago

Our model is called Background Matting, which requires you to supply an additional background capture without the subject. You didn't seem to provide a correctly pre-captured background through the --video-bgr argument. You are still using the demo bgr.png which is definitely not correct.

ecsplendid commented 3 years ago

Oh! My bad sorry I feel dumb now -- I should have read your paper before trying this. I think I saw BGR as blue,green,red rather than "background" lol

Do you think it would be possible to use your refiner idea on top of a standard salient object segmentation model and skip the need to have the background photo? Does it confer a huge improvement?

Edit -- I've got it working and the results are really nice!

It only produces videos at <10fps for me, GTX 2080GTX and 16 core ryzen, it's CPU bound and not touching the GPU
It's not at all robust to changing lighting conditions, it's only really appropriate for recording videos within about 5 minutes of taking the background shot
The greenscreen program I made which uses u2net runs at about 50fps on my machine and is way more robust to changing background conditions https://github.com/ecsplendid/rembg-greenscreen so I will still use that as default driver but will use this for filming short intro type videos.
I've already made a script which will extract the first frame from a video with ffmpeg then run your program with that frame as an bgr argument to save time so it's pretty quick to iterate

Machine utilisation:

PeterL1n commented 3 years ago

You can definitely adopt the refiner idea to segmentation networks. This requires you to experiment and train your own network. It will definitely allow processing higher resolution images with much lower resources than passing through a standard conv-net, but it may not able to recover all the hair details without a background.

fangchao commented 3 years ago

Our model is called Background Matting, which requires you to supply an additional background capture without the subject. You didn't seem to provide a correctly pre-captured background through the --video-bgr argument. You are still using the demo bgr.png which is definitely not correct.

Hi, @PeterL1n .I have supply an additional background capture without the subject by my mobile phone, but it is still not work. Is there anything else I need to modify or supply?

Thanks a lot.

"inference_video.sh" 17L, 623C 1,1 All python3 inference_video.py \ --model-type mattingbase \ --model-backbone resnet50 \ --model-backbone-scale 0.25 \ --model-refine-mode sampling \ --model-refine-sample-pixels 80000 \ --model-checkpoint ./PyTorch/pytorch_resnet50.pth \ --video-src "./1615342262545425.mp4" \ --video-bgr "./WechatIMG125.jpeg" \ --video-resize 1920 1080 \ --output-dir "./test" \ --output-type com fgr pha err

This is my video-gbr. WechatIMG125

PeterL1n commented 3 years ago

Change model-type to mattingrefine.

You better provide video frame, background image, and model output to let me debug.

swelcker commented 3 years ago

I think the background needs to have the same size/scale as the input video frame size

PeterL1n / BackgroundMattingV2

Doesn't work on any of my videos #66