[PR Welcome] More data augmentation methods

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

https://mmaction2.readthedocs.io

Apache License 2.0

4.28k stars 1.25k forks source link

[PR Welcome] More data augmentation methods #436

Open irvingzhang0512 opened 3 years ago

irvingzhang0512 commented 3 years ago

Describe the feature More data augmentation methods.

Motivation When training our own models, we need to TRY EVERYTHING...

Additional context

MMaction2 existing data augmentation methods:
- random flip.
- all kinds of crop and resize
- color jitter(brightness/contrast/saturation/hue)

Plans

[x] imgaug #492
[x] mixup and cutmix #681
[x] RandAugment by imgaug #585
[x] PytorchVideo Transforms. #1008
[ ] albumentations
[ ] videomix

More image augmentation methods could refer to PaddleClas.

innerlee commented 3 years ago

Thanks for the enthusiastic! Here is a practical way:

Make a list of candidate augmentations. For each augmentation, find its original paper, together with a reference implementation.
Make individual pull requests for each one of the list.

You may use this issue to track progress.

dreamerlin commented 3 years ago

Yeah, Thanks for your proposal. It is better to provide a list of candidate augmentations so that we can also help to implement some of these ! @irvingzhang0512

irvingzhang0512 commented 3 years ago

this paper shows that random rotation may help

innerlee commented 3 years ago

In image classification, rotation works to some extent. But color jittering sometimes does not work.

Whether they work or not, it does not affect whether we implement them.

edit: @dreamerlin had some code on color jittering

innerlee commented 3 years ago

Supporting rotation would be great.

I also wanted to have some camera transformations, such as

push in
pull out
pan
tilt

Do you have interest in implementing them?

irvingzhang0512 commented 3 years ago

Supporting rotation would be great.

I also wanted to have some camera transformations, such as

push in

pull out

pan

tilt

Do you have interest in implementing them?

@innerlee I'm a little busy until Jan. 20th. If no one implemente these then, I'll have a try. Before that, I'll implement tsm-mobilenet and support imgaug/albumentation in pipeline.

innerlee commented 3 years ago

No hurry.

There is also lots of movie transition fx that can do some fancy temporal "mixup"

irvingzhang0512 commented 3 years ago

Find an interesting paper here. This paper proposes a video augmentation strategy called VideoMix.

innerlee commented 3 years ago

haha

irvingzhang0512 commented 3 years ago

PytorchVideo transform random_resized_crop support shift mode, which looks like the movie transforms. Maybe we can support this. @innerlee @dreamerlin