Open det-tu opened 1 year ago
Hi, COCO and VG are easy to download. For SBU, CC3M and CC12M, you can refer to https://github.com/rom1504/img2dataset.
Thanks~
Could you provide your downloading scripts for SBU, CC3M and CC12M? I cannot align my dataset format with your readme through https://github.com/rom1504/img2dataset.
Describe Model I am using (UniLM, MiniLM, LayoutLM ...): VLMO/BEiTv3
Is there any chance to share pre-training datasets used in VLMO/BEiTv3 through Baidu Net Disk or Google Cloud, as many image urls are inaccessible now. Thanks.