================================================================================
This repository contains the code for our CVPR 2016 paper:
Christoph Feichtenhofer, Axel Pinz, Andrew Zisserman
"Convolutional Two-Stream Network Fusion for Video Action Recognition"
in Proc. CVPR 2016
If you find the code useful for your research, please cite our paper:
@inproceedings{feichtenhofer2016convolutional,
title={Convolutional Two-Stream Network Fusion for Video Action Recognition},
author={Feichtenhofer, Christoph and Pinz, Axel and Zisserman, Andrew},
booktitle={Conference on Computer Vision and Pattern Recognition (CVPR)},
year={2016}
}
The code was tested on Ubuntu 14.04 and Windows 10 using MATLAB R2015b and NVIDIA Titan X or Z GPUs.
If you have questions regarding the implementation please contact:
Christoph Feichtenhofer <feichtenhofer AT tugraz.at>
================================================================================
Download the code git clone --recursive https://github.com/feichtenhofer/twostreamfusion
Compile the code by running compile.m
.
Edit the file cnn_setup_environment.m to adjust the models and data paths.
Download pretrained model files and the datasets, linked below and unpack them into your models/data directory.
cnn_ucf101_spatial();
to train the appearance network stream.cnn_ucf101_temporal();
to train the optical flow network stream.Run
cnn_ucf101_fusion();
this will use the downloaded models and demonstrate training of our final architecture on UCF101/HMDB51.
opts.train.gpus
cudnnWorkspaceLimit
(512MB is default)Pre-computed optical flow images and resized rgb frames for the UCF101 and HMDB51 datasets