4paradigm-CV / SE-STAD

8 stars 1 forks source link

SE-STAD: A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector

Introduction

This repository hodes the official implementation of the paper "A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector".

We are continuing cleaning the code and we have released part of our code. Our paper is accepted to WACV 2023 and an arXiv version can be found at this link.

To-do list

Code Overview

Our code is based on MMAction2 and MMDetection2 with some major modification changes.

Installation

To Reproduce Our Results on AVA

Download Features and Annotations

Training and Evaluation

cd mmaction2
./run/train/slowfast_r50_fcos.sh

References

If you are using our code, please consider citing our paper.

@inproceedings{sui2023sestad,
  title={A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector},
  author={Sui, Lin, and Zhang, Chen-Lin and Gu, Lixin and Han Feng},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)},
  month={January},
  year={2023},
  pages={in press}
}