Hyperparams and setup for DIORA with random word embeddings on MSCOCO

bobwan1995 / cliora

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

35 stars 3 forks source link

Thanks for your interest in this work! The hyperparameters for MSCOCO are shown as below:

python cliora/scripts/train.py \ --cuda \ --max_epoch 30 \ --arch mlp \ --batch_size 32 \ --emb none \ --hidden_dim 400 \ --k_neg 100 \ --log_every_batch 100 \ --lr 5e-3 \ --normalize unit \ --reconstruct_mode softmax \ --train_filter_length 40 \ --data_type coco \ --train_path ./coco_data/train_gold_caps.json \ --validation_path ./coco_data/val_gold_caps.json \ --experiment_path $EXP_PATH

The most important parameters are batch size and lr. The lr is larger than that used in Flickr30K. The json files are directly from VPCFG. I'll supplement the exps of MSCOCO soon.

bobwan1995 / cliora

Hyperparams and setup for DIORA with random word embeddings on MSCOCO #1