HHTseng / video-classification

Tutorial for video classification/ action recognition using 3D CNN/ CNN+RNN on UCF101
936 stars 216 forks source link

Possible CPU bottleneck #13

Open huangchaoxing opened 5 years ago

huangchaoxing commented 5 years ago

This is not an issue but I am still curious. Before feeding those features into the lstm, the CNN will have to extract the features by using a for loop, and this might slow down the process of training. Actually, when I am using your code to train on my own dataset, which contains more than 100000 images, it takes more than half an hour to train the resnet+LSTM for one epoch on 3 TITANX GPU(batch size = 48, num_workers=32). Do you have any idea how to improve the speed?