Datasets used in the pretraining

iflytek / VLE

VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)

Apache License 2.0

184 stars 12 forks source link

Open gaoCleo opened 11 months ago

gaoCleo commented 11 months ago

What datasets were used in the pre-training stage? Especially the patch box classification task.