krasserm / fairseq-image-captioning

Transformer-based image captioning extension for pytorch/fairseq
Apache License 2.0
313 stars 56 forks source link

Support variable number of image feature vectors #1

Closed krasserm closed 4 years ago

krasserm commented 4 years ago
krasserm commented 4 years ago

I just committed an update that fixes an improper handling of the maximum number of source and target positions. These can be set for training to ensure that the number of image feature vectors and/or caption tokens do not exceed a given maximum number. Default is 64 image feature vectors and 1024 caption tokens.