issues
search
opendatalab
/
MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
https://opendatalab.com/OpenSourceTools?tool=extract
GNU Affero General Public License v3.0
18.22k
stars
1.31k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
使用magic-pdf命令,报错OpenBLAS线程限制
#1019
Muyi030
opened
4 days ago
1
refactor(para): adjust right margin threshold based on block width
#1018
myhloli
closed
5 days ago
0
ppocr DEBUG 请问这是错误吗?
#1017
sanwacompany
closed
4 days ago
2
build(setup): add old_linux specific dependencies
#1016
myhloli
closed
5 days ago
0
ERROR: detectron2-0.6-cp310-cp310-macosx_10_9_universal2.whl is not a supported wheel on this platform.
#1015
CyberAsteroid
closed
5 days ago
2
【QA】mineru公式后处理问题
#1014
dt-yy
closed
2 days ago
1
refactor(para): improve paragraph splitting logic
#1013
myhloli
closed
5 days ago
0
add DocLayout-YOLO url
#1012
qiangqiang199
closed
5 days ago
1
add Doclayout-yolo url
#1011
qiangqiang199
closed
5 days ago
1
feat(ocr): improve handling of angled text boxes
#1010
myhloli
closed
5 days ago
0
标题识别和代码识别需求
#1009
Tian14267
closed
4 days ago
6
FastAPI的PDF解析接口,解析完的md文件和图片在哪里可以看到
#1008
asenasen123
opened
5 days ago
0
页眉页脚解析问题
#1007
zhongxin129
opened
5 days ago
0
fix: using new data api replace old rw api
#1006
icecraft
closed
4 days ago
0
fastapi部署时,返回结果出错
#1005
asenasen123
opened
5 days ago
0
由于新版本albumentations依赖simsimd导致不支持Centos7的说明
#1004
myhloli
closed
1 day ago
0
内网无法访问huggingface
#1002
yq-warehouse
closed
5 days ago
24
refactor(tests): extract common test utilities into test_commons.py
#1001
myhloli
closed
5 days ago
0
请问目前能支持centos7系统吗
#1000
Muyi030
closed
2 days ago
7
`unimernet` CustomMBartDecoder does not support Flash Attention 2
#999
sepcnt
opened
5 days ago
0
test(unitest): Restore unit test cases
#998
myhloli
closed
5 days ago
0
使用Quick CPU Demo中的命令下载预编译错误
#997
yq-warehouse
closed
5 days ago
4
如何使用RapidTable?改配置文件不生效
#996
charliedream1
closed
5 days ago
24
在Django中启动项目后出现了内存溢出
#995
haoweiwang0
closed
5 days ago
1
MinerU无法识别多级标题,识别的标题全部归为一级标题
#994
JoshonSmith
closed
5 days ago
2
Good
#993
Davidjennison1
closed
5 days ago
0
PaddlePaddle相关问题复现case
#992
phlrain
opened
5 days ago
1
Post in thread 'Boba's Dakar Yellow E46 M3 to CSL look-a-likey'
#991
Davidjennison1
closed
5 days ago
0
3 requirements files are there which one should use
#990
Akshaybhure111
closed
5 days ago
1
how have you processed the blocks after finding out the layout order?
#989
vikas-singh16
closed
2 days ago
2
T
#988
Davidjennison1
closed
6 days ago
0
Error related to script
#987
Akshaybhure111
closed
6 days ago
9
update ci
#986
dt-yy
closed
5 days ago
0
【QA】0.9.3版本配置改成table-master生成的md表格为图片
#985
dt-yy
closed
6 days ago
1
【QA】0.9.3版本 单词黏连问题
#983
dt-yy
closed
6 days ago
1
【QA】0.9.0版本行内公式前后多了空格
#982
dt-yy
closed
6 days ago
1
【QA】MinerU0.9.0 API版本从 Hugging Face 下载模型 error
#981
dt-yy
closed
6 days ago
1
argument expect 3 but 4 given
#980
Akshaybhure111
closed
6 days ago
2
不知道可否支持 MLX
#979
yibie
opened
1 week ago
2
希望能添加控制输出结构的选项
#978
yibie
closed
1 week ago
1
docs: update readme
#977
myhloli
closed
1 week ago
0
Dev to 0.9.3
#976
myhloli
closed
1 week ago
0
docs: update feature description for table conversion
#975
myhloli
closed
1 week ago
0
docs: improve GPU support list formatting in README_zh-CN.md
#974
myhloli
closed
1 week ago
0
docs(README): update GPU hardware recommendations and table recognition options
#973
myhloli
closed
1 week ago
0
magic_pdf.user_api:parse_pdf:97 - string index out of range
#972
yibie
closed
1 week ago
5
fix: 修复issue opendatalab#715
#971
LollipopsAndWine
closed
1 week ago
0
Python 3.11 及更高版本支持?
#970
stevenhe1988
opened
1 week ago
2
Release 0.9.3
#969
myhloli
closed
1 week ago
1
Dev to 0.9.3
#968
myhloli
closed
1 week ago
0
Previous
Next