microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
https://aka.ms/GeneralAI
MIT License
19.49k stars 2.48k forks source link

[MarkupLM] Code for pre-training #918

Open ilyalasy opened 1 year ago

ilyalasy commented 1 year ago

Hi, is it possible to have access to original training code of MarkupLM (CommonCrawl preprocess, tags masking, etc.) ?

yash0307 commented 1 year ago

Hi, did you come across the code?

ilyalasy commented 1 year ago

Hi, did you come across the code?

Hi, no, I think its proprietary