VikParuchuri marker issues

VikParuchuri / marker

Convert PDF to markdown quickly with high accuracy

https://www.datalab.to

GNU General Public License v3.0

14.15k stars 720 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

The processing is still very slow. Is the entire program executed in parallel or serially?

#125 xuboot closed 1 month ago
2
Local Model Loading Issue

#124 wangyonghui3397 opened 1 month ago
0
AttributeError: 'NoneType' object has no attribute 'bboxes'

#123 Volkopat opened 1 month ago
4
Does marker have services and applications integrated with LLM large models?

#122 xuboot closed 1 month ago
1
V2 is a huge improvement

#121 Lastofthefirst opened 1 month ago
4
Cannot uninstall TBB

#120 RUANRui-ECON closed 3 weeks ago
7
AttributeError: 'PdfDocument' object has no attribute 'name'

#119 lvjg opened 1 month ago
1
I tested it and the speed is very slow. Can it be optimized? It seems that it cannot meet the online needs.

#118 xuboot closed 1 month ago
3
Missing config.yml ?

#117 jstjoe closed 1 month ago
3
Marker v2

#116 VikParuchuri closed 1 month ago
0
Automatic batch size using accelerate

#115 juletx closed 1 month ago
1
Code formatting, update batch sizes

#114 VikParuchuri closed 2 months ago
0
hello, I get an error when converting multiple pdf files in one directory to mkdown. How can I adjust the parameters and optimize them?

#113 xuboot closed 2 months ago
1
Add bold/italic formatting

#112 VikParuchuri closed 2 months ago
0
Add image extraction support

#111 VikParuchuri closed 2 months ago
0
Problem detecting columns / VRAM Requirements GPU acceleration

#110 tilllt closed 2 months ago
6
Fix issues

#109 VikParuchuri closed 2 months ago
0
Patch

#108 VikParuchuri closed 2 months ago
0
Very early commercial marker preview

#107 VikParuchuri closed 2 months ago
0
RuntimeError: Invalid buffer size 17.26 GB when using LayoutLMv3 model in PDF conversion script

#106 archit15singh closed 2 months ago
1
.title() function - alternative for other languages

#105 BrokenChip231 opened 2 months ago
0
Google Colab notebook?

#104 yachty66 opened 2 months ago
2
Found nonstandard filetype xml 1.0 document, ascii text, with very long lines (2528)

#103 SmallBlueWolf closed 2 months ago
0
Add build-docker-containers.sh script and Dockerfiles for CPU and GPU…

#102 robin-collins opened 3 months ago
1
Create docker-build.yml

#101 robin-collins closed 3 months ago
0
package for Windows?

#100 KarissaChan1 closed 2 months ago
3
Illegal hardware instruction

#99 amuricys closed 2 months ago
3
Is there any option to export image for PDF figures？

#98 Watterry closed 2 months ago
1
Can't install Ray

#97 amuricys closed 2 months ago
4
The output markdown file has duplicate content

#96 Xiaoyuan-xyz opened 3 months ago
0
convert.py crashes with CUDA error and hangs on CPU

#95 thawn opened 3 months ago
0
Lock file is out of date

#94 tekumara closed 2 months ago
3
💎 feat(release): Dockerfile

#93 mxchinegod opened 4 months ago
0
make count more efficient

#92 HubertY opened 4 months ago
0
Import error magic (mac install)

#91 gpillemermbww closed 2 months ago
2
How much GPU memory is required to train the layout segmenter model？

#90 codeants2012 closed 2 months ago
1
can you offer the train_data of layout segmenter model？

#89 codeants2012 closed 4 months ago
2
Using Surya along with "marker" to get a formatted md file as output

#88 trivikramak closed 4 months ago
1
has chinese version?

#87 codeants2012 closed 4 months ago
1
Segmenting Markdown-converted PDFs into pages

#86 umarbutler closed 3 weeks ago
7
Improved Windows installation instructions

#85 umarbutler closed 2 months ago
3
How to add option to marker page range of pdf

#84 BenEcon opened 5 months ago
1
Tesseract is run despite text layer being present

#83 lvsass closed 2 months ago
1
Issue with indexer

#82 akaler727 closed 2 months ago
4
Issue with installation requirements (pydantic_settings)

#81 JamMaster1999 closed 5 months ago
1
ValueError: could not convert string to float: 'True' in convert_single.py

#80 mrticker closed 2 months ago
3
ZeroDivisionError: float division by zero - File "/marker/marker/cleaners/code.py", line 111, in indent_blocks

#79 mrticker closed 2 months ago
2
Token indices sequence length is longer than the specified maximum sequence length for this model (395 > 384)

#78 mrticker closed 2 months ago
3
If a PDF contains #s, they will become headers in markdown

#77 mrticker closed 2 months ago
1
What would you recommend for converting latex to markdown?

#76 mrticker closed 2 months ago
1

Previous Next