xmu-xiaoma666 / LSTNet

Towards Local Visual Modeling for Image Captioning
MIT License
26 stars 6 forks source link