aminullah6264 / Activity_Rec_ML-LSTM

Activity Recognition using Temporal Optical Flow Convolutional Features and Multi-Layer LSTM
https://sites.google.com/view/aminullah/home
25 stars 17 forks source link
action-recognition activity-recognition human-action-recognition human-activity-recognition lstm lstm-cnn lstm-networks

Activity Recognition using Temporal Optical Flow Convolutional Features and Multi-Layer LSTM

Paper

https://ieeexplore.ieee.org/document/8543495

Demo Results

https://www.youtube.com/watch?v=x58jSG8IvXQ&t=74s https://www.youtube.com/watch?v=cyhIVOnEAMg&t=6s

Compiling

First compile caffe, by configuring

"Makefile.config" (example given in Makefile.config.example)

then make with

$ make -j 5 all tools pycaffe 

Running

(this assumes you compiled the code sucessfully)

IMPORTANT: make sure there is no other caffe version in your python and system paths and set up your environment with:

$ source set-env.sh 

This will configure all paths for you. Then go to the model folder and download models:

$ cd models 
$ ./download-models.sh 

Features Extraction

Extract temporal optical flow features from activity recogntion datasets: *Activity recogntion datasets can be downloaded from the following Links

Training

First you need to prepare the training data using Features_Extraction.py

$ python scripts/Training_ML_LSTM.py 
Change path in code: Line No. 147

Testing

Testing video using trained multi-layer LSTM

$ scripts/Video_Testing.py 
Change paths: Line 40, 62, 63, 175

This code can only be used for research purposes:

Citation


Ullah, A., Muhammad, K., Baik, S. W. (2018). Activity Recognition using Temporal Optical Flow Convolutional Features and Multi-Layer LSTM. IEEE Transactions on Industrial Electronics.

Ullah, A., Ahmad, J., Muhammad, K., Sajjad, M., & Baik, S. W. (2018). Action Recognition in Video Sequences using Deep Bi-  Directional LSTM With CNN Features. IEEE Access, 6, 1155-1166.

Ilg E, Mayer N, Saikia T, Keuper M, Dosovitskiy A, Brox T. Flownet 2.0: Evolution of optical flow estimation with deep networks. InIEEE conference on computer vision and pattern recognition (CVPR) 2017 Jul 1 (Vol. 2, p. 6).