VIPL-SLP / VAC_CSLR

Visual Alignment Constraint for Continuous Sign Language Recognition. ( ICCV 2021)
https://openaccess.thecvf.com/content/ICCV2021/html/Min_Visual_Alignment_Constraint_for_Continuous_Sign_Language_Recognition_ICCV_2021_paper.html
Apache License 2.0
116 stars 19 forks source link

Video augmentation methods for Pre-trained model #11

Open Aayush2007 opened 2 years ago

Aayush2007 commented 2 years ago

What are the video augmentation options used in the pre-trained model ([Dropbox]) ? In the code I can see that these are the ones uncommented, is that the case for the pretrained model? dataset/dataloader_video.py ksnip_20220225-151108

ycmin95 commented 2 years ago

Yes,as mentioned in our paper, we adopted three kinds of data augmentation strategies (random crop, horizontal flip and random temporal scaling) during training.

atonyo11 commented 2 weeks ago

@ycmin95, Why you choose horizontal flip methods? Does it change the meaning of sign?

ycmin95 commented 2 weeks ago

Relevant information can be found in the Table 3. (Ablation results of augmentation) of the supplymentary material, which can be download from: https://openaccess.thecvf.com/content/ICCV2021/supplemental/Min_Visual_Alignment_Constraint_ICCV_2021_supplemental.zip