karpathy / neuraltalk

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
5.4k stars 1.32k forks source link

Size of Descriptive Sentences #20

Open cicero19 opened 9 years ago

cicero19 commented 9 years ago

Hi Andrej,

Is there a limit to the size of the descriptive sentences? Has it been tried with multiple sentences each describing different features of the image? For example, if an image had a descriptor "A dog in a park. A kite in the sky." could it generate two sentences if the training data was in a similar format? OR is it better to split the descriptive sentences into several single sentence examples and show the same image for each (ie. image A: dog in a park, image A: kite in the sky).

Also, is the matlab feature extractor GPU enabled?

Thanks!

EricZeiberg commented 9 years ago

I'm not sure about the first question, but caffe itself is GPU supported with the CUDA Nvidia libraries. Go here http://caffe.berkeleyvision.org/installation.html for more info