opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
https://opendatalab.com/OpenSourceTools?tool=extract
GNU Affero General Public License v3.0
18.18k stars 1.3k forks source link

fix: using new data api replace old rw api #1006

Closed icecraft closed 4 days ago

icecraft commented 5 days ago

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers.

Motivation

  1. replace old rw module with new data_reader_writer module
  2. fix multiple test_cases

Modification

  1. tools/
  2. projects/gradio
  3. pipe/
  4. user_api
  5. fix multiple test_cases (test_data, test_tools, test_integrations, test_model)

Use cases (Optional)

If this PR introduces a new feature, it is better to list some use cases here and update the documentation.

Checklist

Before PR:

After PR: