opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
https://opendatalab.com/OpenSourceTools?tool=extract
GNU Affero General Public License v3.0
17.63k stars 1.28k forks source link

Config File issue #680

Closed Akhilesh-pandey1 closed 1 month ago

Akhilesh-pandey1 commented 1 month ago

is this config file structure is correct ? """ { "models-dir": "D:\work_easier_project\AdityaOCR\models", "device-mode": "cpu", "table-config": { "model": "TableMaster", "is_table_recog_enable": false, "max_time": 400 }, "ocr-config": { "enabled": false, "language": "eng" } } """

if not then please anyone give me correct structure, in which tell me how to specify OCR ? And what what thing I can specify with config file, tell me.

myhloli commented 1 month ago

you can't set ocr config in this json. we use cli args to config ocr mode. you can use magic-pdf --help to know how to use cli config.