issues
search
VikParuchuri
/
marker
Convert PDF to markdown quickly with high accuracy
https://www.datalab.to
GNU General Public License v3.0
14.15k
stars
720
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
The processing is still very slow. Is the entire program executed in parallel or serially?
#125
xuboot
closed
1 month ago
2
Local Model Loading Issue
#124
wangyonghui3397
opened
1 month ago
0
AttributeError: 'NoneType' object has no attribute 'bboxes'
#123
Volkopat
opened
1 month ago
4
Does marker have services and applications integrated with LLM large models?
#122
xuboot
closed
1 month ago
1
V2 is a huge improvement
#121
Lastofthefirst
opened
1 month ago
4
Cannot uninstall TBB
#120
RUANRui-ECON
closed
3 weeks ago
7
AttributeError: 'PdfDocument' object has no attribute 'name'
#119
lvjg
opened
1 month ago
1
I tested it and the speed is very slow. Can it be optimized? It seems that it cannot meet the online needs.
#118
xuboot
closed
1 month ago
3
Missing config.yml ?
#117
jstjoe
closed
1 month ago
3
Marker v2
#116
VikParuchuri
closed
1 month ago
0
Automatic batch size using accelerate
#115
juletx
closed
1 month ago
1
Code formatting, update batch sizes
#114
VikParuchuri
closed
2 months ago
0
hello, I get an error when converting multiple pdf files in one directory to mkdown. How can I adjust the parameters and optimize them?
#113
xuboot
closed
2 months ago
1
Add bold/italic formatting
#112
VikParuchuri
closed
2 months ago
0
Add image extraction support
#111
VikParuchuri
closed
2 months ago
0
Problem detecting columns / VRAM Requirements GPU acceleration
#110
tilllt
closed
2 months ago
6
Fix issues
#109
VikParuchuri
closed
2 months ago
0
Patch
#108
VikParuchuri
closed
2 months ago
0
Very early commercial marker preview
#107
VikParuchuri
closed
2 months ago
0
RuntimeError: Invalid buffer size 17.26 GB when using LayoutLMv3 model in PDF conversion script
#106
archit15singh
closed
2 months ago
1
.title() function - alternative for other languages
#105
BrokenChip231
opened
2 months ago
0
Google Colab notebook?
#104
yachty66
opened
2 months ago
2
Found nonstandard filetype xml 1.0 document, ascii text, with very long lines (2528)
#103
SmallBlueWolf
closed
2 months ago
0
Add build-docker-containers.sh script and Dockerfiles for CPU and GPU…
#102
robin-collins
opened
3 months ago
1
Create docker-build.yml
#101
robin-collins
closed
3 months ago
0
package for Windows?
#100
KarissaChan1
closed
2 months ago
3
Illegal hardware instruction
#99
amuricys
closed
2 months ago
3
Is there any option to export image for PDF figures?
#98
Watterry
closed
2 months ago
1
Can't install Ray
#97
amuricys
closed
2 months ago
4
The output markdown file has duplicate content
#96
Xiaoyuan-xyz
opened
3 months ago
0
convert.py crashes with CUDA error and hangs on CPU
#95
thawn
opened
3 months ago
0
Lock file is out of date
#94
tekumara
closed
2 months ago
3
💎 feat(release): Dockerfile
#93
mxchinegod
opened
4 months ago
0
make count more efficient
#92
HubertY
opened
4 months ago
0
Import error magic (mac install)
#91
gpillemermbww
closed
2 months ago
2
How much GPU memory is required to train the layout segmenter model?
#90
codeants2012
closed
2 months ago
1
can you offer the train_data of layout segmenter model?
#89
codeants2012
closed
4 months ago
2
Using Surya along with "marker" to get a formatted md file as output
#88
trivikramak
closed
4 months ago
1
has chinese version?
#87
codeants2012
closed
4 months ago
1
Segmenting Markdown-converted PDFs into pages
#86
umarbutler
closed
3 weeks ago
7
Improved Windows installation instructions
#85
umarbutler
closed
2 months ago
3
How to add option to marker page range of pdf
#84
BenEcon
opened
5 months ago
1
Tesseract is run despite text layer being present
#83
lvsass
closed
2 months ago
1
Issue with indexer
#82
akaler727
closed
2 months ago
4
Issue with installation requirements (pydantic_settings)
#81
JamMaster1999
closed
5 months ago
1
ValueError: could not convert string to float: 'True' in convert_single.py
#80
mrticker
closed
2 months ago
3
ZeroDivisionError: float division by zero - File "/marker/marker/cleaners/code.py", line 111, in indent_blocks
#79
mrticker
closed
2 months ago
2
Token indices sequence length is longer than the specified maximum sequence length for this model (395 > 384)
#78
mrticker
closed
2 months ago
3
If a PDF contains #s, they will become headers in markdown
#77
mrticker
closed
2 months ago
1
What would you recommend for converting latex to markdown?
#76
mrticker
closed
2 months ago
1
Previous
Next