Open Jennifer-6 opened 3 years ago
yes. The text input use the same word embedding.
Hi, Where can I find list of all object tags used in OSCAR? I am assuming OSCAR+ uses same dictionary given with VinVL. But I am not sure about the dictionary of OSCAR.
Is word embedding of object tags the same embedding matrix as word embedding of caption