Open AnjumJ123 opened 2 years ago
Not sure why it is looking for dev.jsonl when that is not one of the .jsonl files in the hateful_memes dataset. Any suggestions to address this error?
'!mmf_run config="projects/visual_bert/configs/hateful_memes/from_coco.yaml" \ model=visual_bert \ dataset=hateful_memes \ run_type=train_val \ training.log_interval=200 \ training.max_updates=22000 \ training.batch_size=64 \ training.evaluation_interval=200 \ training.tensorboard=True \ training.checkpoint_interval=200 \ checkpoint.resume_pretrained=True \ checkpoint.resume_zoo=visual_bert.pretrained.coco \ dataset_config.hateful_memes.annotations.train[0]="hateful_memes/defaults/annotations/train.jsonl" \ dataset_config.hateful_memes.annotations.val[0]="hateful_memes/defaults/annotations/dev_unseen.jsonl" \ dataset_config.hateful_memes.annotations.test[0]="hateful_memes/defaults/annotations/test_unseen.jsonl"'
Error:
You can disable this warning by setting the environment variable OC_DISABLE_DOT_ACCESS_WARNING=1
warnings.warn(message=msg, category=UserWarning)
Overriding option config to projects/visual_bert/configs/hateful_memes/from_coco.yaml
Overriding option model to visual_bert
Overriding option datasets to hateful_memes
Overriding option run_type to train_val
Overriding option training.log_interval to 200
Overriding option training.max_updates to 22000
Overriding option training.batch_size to 64
Overriding option training.evaluation_interval to 200
Overriding option training.tensorboard to True
Overriding option training.checkpoint_interval to 200
Overriding option checkpoint.resume_pretrained to True
Overriding option checkpoint.resume_zoo to visual_bert.pretrained.coco
Using seed 30549397
Logging to: ./save/logs/train_2022-04-20T16:33:30.log
Downloading features.tar.gz: 100% 8.44G/8.44G [06:24<00:00, 22.0MB/s]
Traceback (most recent call last):
File "/usr/local/bin/mmf_run", line 8, in
I am trying to reproduce the code for running VilBERT on hateful meme dataset, but the exiting code needs to be modified to point to the new data source for hateful meme challenge data and then be linked in the code.
Facebook released a webpage (https://hatefulmemeschallenge.com/) where the dataset can be downloaded. Would you be able to change the notebook slightly to reproduce the code? I am facing challenges in mmf_convert_hm to get the hateful meme data to be converted into the MMF format and the image, .jsonl files to be moved to the corresponding folders as the repo expects. Facing challenge in terms of ensuring the MMF pre-requisites after downloading the datasets and the changes that need to be made if any in the .yaml and config files to make the code to work.