This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as input a video and generates a caption in English describing the video.
Hi, Looking at the feature extraction part, I cannot see if you have reversed the channels to BGR from the RGB frames. Have you trained the VGG network on RGB? Please confirm.
Hi, Looking at the feature extraction part, I cannot see if you have reversed the channels to BGR from the RGB frames. Have you trained the VGG network on RGB? Please confirm.