krasserm / fairseq-image-captioning

Transformer-based image captioning extension for pytorch/fairseq
Apache License 2.0
312 stars 55 forks source link

Training end-to-end on my own dataset #26

Open MiriamFarber opened 3 years ago

MiriamFarber commented 3 years ago

I have my own dataset of (image, caption) pairs on which I'd like to train the model. Does this repository enables to do that without first extracting features/bounding boxes?

Can I do it via avoiding passing the flag --features ?