This repo contains some of the codes for the following paper Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.
Now this repo only contains codes for SAT, BUTD and CNN-C as was written in the paper.
evalcap folder can be downloaded from here.
To run the code for training, do sh train.sh. To test, sh test.sh
I kept having bad results for CNN-C model, with all the generations in the val set be the same. I had the same issue when I tried to adopt from self-critical.pytorch. This never happened before when I ran the experiments for the ECCV paper. I really appreciate if anyone find the reason why this happened.
To get the preprocessed data, use this or email: Xuewen Yang @ xuewen.yang@innopeaktech.com if you need the raw data. (The original data is unfortunately broken now. I still have them, keeping them in a hard drive.)
For other issues, please create an issue on this repo.
If you want to download the original dataset (some data might be missing), you can:
If you use this data, please cite:
@inproceedings{XuewenECCV20Fashion,
Author = {Xuewen Yang and Heming Zhang and Di Jin and Yingru Liu and Chi-Hao Wu and Jianchao Tan and Dongliang Xie and Jue Wang and Xin Wang},
Title = {Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards},
booktitle = {ECCV},
Year = {2020}
}