About training set used in portrait-net for video

anilsathyan7 / Portrait-Segmentation

Real-time portrait segmentation for mobile devices

MIT License

638 stars 133 forks source link

About training set used in portrait-net for video #6

Closed fangchao closed 4 years ago

fangchao commented 4 years ago

Congrats on the awesome work done and thanks for sharing. I wanna train a portrait-net for video. The performance is not as good as the usually semantic segmentation net whose number of input channel is 3. Could you tell me how many empty previous mask and augmented previous mask in your training set? Thanks.

anilsathyan7 commented 4 years ago

It's done on the fly as part of data augmentation(with a chance). As far as stability of video is considered, this one seems to be better than 3 channel inputs. The training dataset consists of close--up portraits mostly. If we train on a bigger dataset(with variety) it may work better, i suppose. Please look at the original paper for more information. Also make sure you use proper normalization, preprocessing etc.

fangchao commented 4 years ago

@anilsathyan7 I've got it. Thanks for your reply.