JDAI-CV / image-captioning

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
268 stars 52 forks source link

Ensemble code and how to use senet154 #29

Closed searchstorm closed 2 years ago

searchstorm commented 2 years ago

你好,请问可以提供ensemble的代码吗,还有论文说的以senet154为backbone是指将faster rcnn的backbone换成了senet 还是直接用它提取特征?有提取好的特征数据吗?

YehLi commented 2 years ago

The senet154 features have been released(Data preparation -- The pretrained SENet-154 model can be downloaded here)

You can refer to the code for ensemble (https://github.com/YehLi/xmodaler/blob/master/xmodaler/modeling/decode_strategy/ensemble_beam_searcher.py)