-
Hi @apsdehal
I wanted to check why there seems to be such high variance on the test set for Text BERT. I reproduce the results here. Can I clarify which test set and val set (seen or unseen?) the …
-
Using CLIP to VQGAN I'm noticing a behaviour that I think is originating in CLIP. Sometimes words in the parsed text are rendered as typography instead of being interpreted and rendered as a object/th…
-
So I previously had trouble reproducing the results using the pretrained models from the model zoo. Now that is fine. I moved on to trying to train the model myself and encounter problems reproducing …
-
## Feature
I was trying to use mmf_predict to create the submission result csv file. I think the mmf_predict command changes the test ids order. That's why I got "IDs for submission are not correct…
-
Hi, I have been working on inference using a number of the baseline models. I've gotten it to work well on image+text but for VisualBert, because use_features is enabled, I need the features.
So I …
-
I have an issue reproducing the baselines in the Hateful Memes Paper. Specifically I am trying to get the baselines for Text BERT but I am also not able to get the baselines for Image-Grid
## Instruc…
-
I want to use the models ConcatBert, Late Fusion, Text Bert, Image Grid and Visualbert COCO for inference of Hateful Memes, by building a website around those models which can take a jpeg/png and the …
-
## ❓ Questions and Help
When I try to run any of the examples to train the models im receiving some messages about some configuration for the optimizers.
If I run the next CLI command:
```
mmf_r…
-
## ❓ Questions and Help
Hi,
I was able to add my own dataset to mmf, however, when I try to train the unimodal image (Image Grid) model on it, I run into an error when evaluating the first epoch:
…
-
I found that weights of decoder are not loaded, so could you explain why and how to add them.