iflytek / VLE

VLE: Vision-Language Encoder (VLE: 视觉-语言多模态预训练模型)
Apache License 2.0
184 stars 12 forks source link

Datasets used in the pretraining #7

Open gaoCleo opened 11 months ago

gaoCleo commented 11 months ago

What datasets were used in the pre-training stage? Especially the patch box classification task.