ChenRocks / UNITER

Research code for ECCV 2020 paper "UNITER: UNiversal Image-TExt Representation Learning"
https://arxiv.org/abs/1909.11740
777 stars 109 forks source link

the vocab file in uniter #56

Open wjy3326 opened 3 years ago

wjy3326 commented 3 years ago

What vocab file does the uniter model use? cased or uncased?

joeyy5588 commented 3 years ago

The default tokenizer in prepro.py is 'bert-base-cased'.