issues
search
dantetemplar
/
pdf-extraction-agenda
Overview of pipelines related to PDF to Markdown document processing.
MIT License
70
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Formatting improvements
#26
alexander-zuev
opened
1 week ago
6
Amazon Textract​ Pipeline
#25
dantetemplar
opened
2 months ago
1
Azure OCR Pipeline
#24
dantetemplar
opened
2 months ago
1
Google Document AI Pipeline
#23
dantetemplar
opened
2 months ago
1
Upstage AI Pipeline
#22
dantetemplar
opened
2 months ago
1
SmolDocling Pipiline
#21
dantetemplar
opened
2 months ago
1
MistralOCR Pipeline
#20
dantetemplar
opened
3 months ago
1
Vision Parse Pipeline
#19
dantetemplar
opened
3 months ago
1
Markdrop Pipeline
#18
dantetemplar
opened
3 months ago
1
Reddit posts related to PDF2MD
#17
dantetemplar
opened
3 months ago
0
Extractous Pipeline
#16
dantetemplar
opened
3 months ago
1
Open-Parse Pipeline
#15
dantetemplar
opened
3 months ago
1
Pix2Text Pipeline
#14
dantetemplar
opened
3 months ago
1
Unstructured Pipeline
#13
dantetemplar
opened
3 months ago
1
Zerox Pipeline
#12
dantetemplar
opened
3 months ago
1
olmoOCR Pipeline
#10
dantetemplar
opened
3 months ago
1
markitdown Pipeline
#9
dantetemplar
opened
3 months ago
1
Marker Pipeline
#8
dantetemplar
opened
3 months ago
1
MinerU Pipeline
#7
dantetemplar
opened
3 months ago
1
LlamaParse Pipeline
#6
dantetemplar
opened
3 months ago
1
Mathpix Pipeline
#5
dantetemplar
opened
3 months ago
1
Nougat Pipeline
#4
dantetemplar
opened
3 months ago
1
GOT-OCR Pipeline
#3
dantetemplar
opened
3 months ago
1
DocLing Pipeline
#2
dantetemplar
opened
3 months ago
1
Test
#1
dantetemplar
closed
3 months ago
4