e-bug / volta

[TACL 2021] Code and data for the framework in "Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs"
https://aclanthology.org/2021.tacl-1.58/
MIT License
113 stars 24 forks source link

vg-gqa_boxes36.lmdb file #16

Closed nooralahzadeh closed 2 years ago

nooralahzadeh commented 2 years ago

Hi, Is it possible to make "vg-gqa_boxes36.lmdb" file available? It seems the h5 version is available to download!

Thanks

e-bug commented 2 years ago

Hi,

I found that uploading the LMDB files made their size in the order of TBs. So, I just uploaded the H5 version, and you can quickly convert it to LMDB using this script.

Let me know if you have any issues!

nooralahzadeh commented 2 years ago

Hi, Thanks for the quick reply. The script creates an lmdb folder which includes data.mdb and lock.mdb. Which one I should point to in the task_config file?

e-bug commented 2 years ago

Just the lmdb folder :)