issues
search
VikParuchuri
/
marker
Convert PDF to markdown quickly with high accuracy
https://www.datalab.to
GNU General Public License v3.0
16.8k
stars
954
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
numpy version mismatch for python 3.9 on mac
#294
rudysev
opened
2 hours ago
0
Integration Suggestion for OCR Wrapper
#293
X-T-E-R
opened
11 hours ago
0
Images fail to be extracted when converting multiple files
#292
Dylanfpv
opened
4 days ago
0
Cannot import name 'threadpool_info' from 'sklearn.utils.fixes'
#291
drunkwcodes
opened
5 days ago
0
Improved Table Parsing
#290
m9e
opened
1 week ago
0
Multiple Model Loading Errors
#289
vishaldwdi
opened
1 week ago
0
Program crashed it self,every time.
#288
seba9601
opened
1 week ago
0
Report an error: RuntimeWarning: invalid value encountered in cast aff_img = Image.fromarray((affinity_map * 255).astype(np.uint8))
#287
potatoo0000
opened
2 weeks ago
4
batch processing stuck
#286
jonabert
opened
2 weeks ago
0
pdf to quarto markdown
#285
horhenani
opened
2 weeks ago
0
hi i am thinking fine tune llm with images and md file
#284
Batuking1111
opened
3 weeks ago
0
Feature request: URL extraction
#283
ShakirAkbari
opened
3 weeks ago
1
Feature Request
#282
kssextro
opened
3 weeks ago
0
module 'torch.nn' has no attribute 'RMSNorm'
#281
robertio
opened
3 weeks ago
0
After converting the pdf is always occupied, windows, cuda
#280
clinton81
opened
3 weeks ago
0
IndexError: list index out of range (New)
#279
wooemans
opened
3 weeks ago
0
Update README: Revise Python version requirement to 3.10+
#278
jcytong
opened
3 weeks ago
2
Absolut Gobboly Goo Output Markdown
#277
lucaspeyrin
opened
4 weeks ago
1
Replace newlines with HTML line breaks in table cells
#276
conscienceli
opened
4 weeks ago
2
option to output markdown to stdout?
#275
daboe01
opened
1 month ago
0
Broken superscripts (references to bibliography items, footnotes, author affiliations, etc.)
#274
XZF0
opened
1 month ago
0
finetune issue
#273
Frank-Zeng
opened
1 month ago
0
marker识别公式能力是不是很差啊
#272
ghost
opened
1 month ago
0
Chang default output setting
#271
yibie
opened
1 month ago
5
Abrubt Termination (Without any error) on Google Colab, AWS EC2
#270
G999n
closed
1 month ago
4
Too much memory cost for big pdf 800 pages , cost 80GB ram.
#269
whp98
opened
1 month ago
2
ImportError: Failed to load PyTorch C extensions
#268
mctouch
opened
1 month ago
1
PermissionError: [Errno 13] Permission denied: 'C:\\Users\\schor\\AppData\\Local\\Temp\\tmpoaply4el.pdf'
#267
Schorakbi
opened
1 month ago
5
AttributeError: 'MBartOrderConfig' object has no attribute 'max_width'
#266
Pikacheng
opened
1 month ago
0
unsupported operand type(s) for |: '_GenericAlias' and 'NoneType'
#265
YuChuanhui3
opened
1 month ago
2
Update minimal required Python to 3.10
#264
rmast
opened
1 month ago
1
Table Detection and Parser
#263
TakshPanchal
closed
1 month ago
1
Improving OCR through higher image DPI?
#262
kkarski
opened
1 month ago
0
Integrate new OCR
#261
VikParuchuri
closed
1 month ago
0
Is there any way to enhance the bibliography?
#260
flight505
opened
1 month ago
0
Improve cold boot time
#259
frankbaele
closed
1 month ago
0
is it possible to update build with numpy < 2
#258
flight505
closed
1 month ago
1
fix: None env parse for OCR_ENGINE
#257
Zxilly
closed
1 month ago
2
OCR_ENGINE=None Doesn't work
#256
svmrw
opened
1 month ago
2
Cannot use MPS with torch multiprocessing share_memory
#255
swswsws583
closed
1 month ago
3
Help!settings is not Take effect
#254
caixiongjiang
closed
1 month ago
1
suggest to json.dumps with `ensure_ascii=False`
#253
Honglei-Cong
opened
1 month ago
0
AttributeError: 'NoneType' object has no attribute 'bboxes'
#252
lianyant
opened
1 month ago
1
upload image to s3 automatically
#251
liqiankun1111
opened
1 month ago
0
Issues about "OCR_ALL_PAGES" and "num_chunks"
#250
ckgithub2019
opened
1 month ago
0
IMPORTANT:Total elapsed time of loading model is 3.1 minutes, super slow, is it normal?
#249
ckgithub2019
opened
1 month ago
8
CONFUSING! About Optional: OCRMyPDF
#248
ckgithub2019
closed
1 month ago
3
ImportError: cannot import name 'segformer' from 'surya.model.detection' (unknown location)
#247
Beaverfffan
opened
1 month ago
1
subprocess.CalledProcessError: Command 'marker /data/mypdf/data/tmp /data/mypdf/tmp_pdf --workers 2 --max 2 ' died with <Signals.SIGSEGV: 11>.
#246
0x01111
opened
1 month ago
0
What's the lowest torch version of requirements? MUST upgrade to 2.4.0?
#245
ckgithub2019
opened
2 months ago
1
Next