ChenRocks / UNITER

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
https://arxiv.org/abs/1909.11740
777 stars 109 forks source link

Vocabulary and single image-question pair prediction #46

Open foxm79 opened 3 years ago

foxm79 commented 3 years ago
  1. Is the vocabulary available that takes the words of the questions and converts them to 'input_ids'?
  2. Is there a function that does this for an input question?
  3. Is there a code that take a single image-question pair and predicts the answer?
tjulyz commented 3 years ago
  1. Refer to the prepro.py in scripts
foxm79 commented 3 years ago

Yes, that is what I followed eventually. Thanks for replying !