labyrinth7x / Deep-Cross-Modal-Projection-Learning-for-Image-Text-Matching

Deep Cross-Modal Projection Learning for Image-Text Matching
72 stars 21 forks source link
image-text

Deep Cross-Modal Projection Learning for Image-Text Matching

This is a Pytorch implmentation for the paper Deep Cross-Modal Projection Learning for Image-Text Matching.
The official implementation in TensorFlow can be found here.

Requirement

Data Preparation

Training & Testing

You should firstly change the param model_path to your current directory.

sh scripts/run.sh

You can directly run the code instead of performing training and testing seperately.
Or training:

sh scripts/train.sh  

Or testing:

sh scripts/test.sh