opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
https://opendatalab.com/OpenSourceTools
GNU Affero General Public License v3.0
11.2k stars 837 forks source link

PDF to TEXT #428

Open hzzheng0612 opened 1 month ago

hzzheng0612 commented 1 month ago

Is your feature request related to a problem? Please describe. 您的特性请求是否与某个问题相关?请描述。

Is it possible for you guys to develop a pdf to txt feature in the future?

drunkpig commented 1 month ago

@hzzheng0612 I think Markdown is just plain text. Could you describe your requirements in more detail?