laura-wang / video_repres_sts

Pytorch implementation of our T-PAMI 2021 paper: Self-supervised Video Representation Learning by Uncovering Motion and Appearance Statistics
48 stars 7 forks source link

Self-Supervised Video Reprepresentation Learning by Uncovering Spatio-temporal Statistics

Pytroch implementation of our T-PAMI 2021 paper "Self-Supervised Video Reprepresentation Learning by Uncovering Spatio-temporal Statistics", a journal extension of our preliminary work presented in CVPR 2019. Extensive additional ananlysis are presented in this journal version. The performance is also significantly improved, by nearly 30%.

The Tensorflow implementation (of our previous CVPR 2019 work) is available at: https://github.com/laura-wang/video_repres_mas.

Overview

Framework of the proposed approach.

Given an unlabeled video clip, 14 motion statistical labels and 13 appearance statistical labels are to be regeressed. These labels characterize the spatial location and dominant direction of the largest motion, the spatial location and dominant color of the largest color diversity along the temporal axis, etc.

Requirements

Usage

Data preparation

UCF101 dataset

Train

python train.py --rgb_prefix RGB_DIR --flow_x_prefix FLOW_X_DIR --flow_y_prefix FLOW_Y_DIR

TODO

Feature evaluation

Citation

If you find this repository useful in your research, please consider citing:

@Article{wang2021self,
author = {Jiangliu Wang, Jianbo Jiao, Linchao Bao, Shengfeng He, Yunhui Liu, Wei Liu},
title = {Self-Supervised Video Representation Learning by Uncovering Spatio-temporal Statistics},
journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
year = {2021}
}
@inproceedings{wang2019self,
  title={Self-Supervised Spatio-Temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics},
  author={Wang, Jiangliu and Jiao, Jianbo and Bao, Linchao and He, Shengfeng and Liu, Yunhui and Liu, Wei},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={4006--4015},
  year={2019}
}