yiling2018 / saem

Learning Fragment Self-Attention Embeddings for Image-Text Matching, in ACM MM 2019
41 stars 7 forks source link