dear author:
for get cls_seq_test, every region corresponding to detection features,it using detection class with caption,but testing time,we will not know ground truth captions, I did not understanding.
thank you, very much,for help,my mail is 2630147239@qq.com
dear author: for get cls_seq_test, every region corresponding to detection features,it using detection class with caption,but testing time,we will not know ground truth captions, I did not understanding. thank you, very much,for help,my mail is 2630147239@qq.com