Junction4Nako / mvp_pytorch

pytorch implementation of mvp: a multi-stage vision-language pre-training framework
MIT License
33 stars 8 forks source link

Question about id2phrase.json #2

Open Ammexm opened 2 years ago

Ammexm commented 2 years ago

Hello, could you please tell me how the format of id2phrase.json file is generated? What is the meaning of the data like "30522": [" Boat ", "two"] in that file? Is the data in it stored as "image_id: tuple"? Or does "30522" represent a randomly generated num?

Junction4Nako commented 2 years ago

30522 represents the embedding id of the phrase ["boat", "two"], which corresponds to the 30522nd embedding of the bert.embedding.word_embedding layer.