ruotianluo / ImageCaptioning.pytorch

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch for archive)
MIT License
1.44k stars 415 forks source link

Chinese image caption, In the result, multiple words of the same type appear #95

Open cylvzj opened 4 years ago

cylvzj commented 4 years ago

Hello, I am using the COCO dataset, A two-layer LSTM model, one layer for top-down attention, and one layer for language models.

Extracting words with jieba I used all the words in the picture description that occurred more than 3 times as a dictionary file, and a total of 14,226 words. words = [w for w in word_freq.keys () if word_freq [w]> 3]

After training the model, when using it, multiple words of the same type appear in the result, such as:

Note notebook laptop computer on bed A little girl little girl girl standing together

How can I solve this problem?

huaifeng1993 commented 4 years ago

看看标签没有没做好呢,分词的时候cut_all设置为False ,jiaba.cut(,cut_all=False)这样设置。

xinli2008 commented 3 years ago

have you soloved this problem?