VikParuchuri / marker

Convert PDF to markdown quickly with high accuracy
https://www.datalab.to
GNU General Public License v3.0
14.65k stars 763 forks source link

can you offer the train_data of layout segmenter model? #89

Closed codeants2012 closed 4 months ago

VikParuchuri commented 4 months ago

https://huggingface.co/datasets/vikp/doclaynet_processed

codeants2012 commented 4 months ago

Is there a similar dataset for Chinese?