issues
search
DS4SD
/
docling
🥚 Transform PDF to JSON or Markdown with ease and speed 🐣
MIT License
197
stars
18
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
chore: fix placeholders in license
#63
dolfim-ibm
closed
1 day ago
0
'NoneType' object has no attribute 'get_width'
#62
golivschultz1996
opened
4 days ago
1
Add CLI
#60
vagenas
opened
5 days ago
1
docs: Update MAINTAINERS.md
#59
cau-git
closed
6 days ago
0
docs: Mention quackling on README
#58
cau-git
closed
6 days ago
0
fix: propagate row_section in tables
#57
dolfim-ibm
closed
1 week ago
0
docs: add instructions for cpu-only installation
#56
dolfim-ibm
closed
1 week ago
0
Some PDFs cause docling-parse to fail with assertion errors
#55
cau-git
closed
6 days ago
1
feat: export document pages as multimodal output
#54
dolfim-ibm
closed
5 days ago
0
fix: table cells overlap and model warnings
#53
dolfim-ibm
closed
1 week ago
0
fix: refine conversion result
#52
vagenas
closed
1 week ago
0
fix: Add unit tests
#51
PeterStaar-IBM
closed
1 week ago
0
docs: update interface in README
#50
dolfim-ibm
closed
1 week ago
0
fix: align output formats
#49
dolfim-ibm
closed
1 week ago
0
Where can I find official docs for the package?
#48
aeamaea
opened
2 weeks ago
4
feat: Page-level error reporting from PDF backend
#47
cau-git
closed
2 weeks ago
0
fix: Better raise exception when a page fails to parse
#46
cau-git
closed
2 weeks ago
0
fix: Upgrade docling-parse to 1.1.1, safety checks for failed parse on pages
#45
cau-git
closed
2 weeks ago
0
feat: Upgrade docling-parse PDF backend and interface to use page-by-page parsing
#44
cau-git
closed
2 weeks ago
0
fix: usage of bytesio with docling-parse
#43
dolfim-ibm
closed
2 weeks ago
0
fix: remove [ocr] extra to fix wheel install
#42
dolfim-ibm
closed
2 weeks ago
0
fix: Add scipy as dependency
#40
cau-git
closed
2 weeks ago
0
fix: Update docling-ibm-models to v1.1.2
#39
cau-git
closed
2 weeks ago
0
feat: Add adaptive OCR, factor out treatment of OCR areas and cell filtering
#38
cau-git
closed
2 weeks ago
0
docs: add technical paper ref
#37
dolfim-ibm
closed
2 weeks ago
0
feat: allow computing page images on-demand with scale and cache them
#36
dolfim-ibm
closed
2 weeks ago
0
chore: Add redbooks to test data, small additions
#35
cau-git
closed
2 weeks ago
0
fix: allow newer torch versions
#34
dolfim-ibm
closed
3 weeks ago
0
fix: Re-map layout class for table of contents
#33
cau-git
closed
3 weeks ago
0
feat: update parser with bytesio interface
#32
dolfim-ibm
closed
3 weeks ago
0
feat: output page images and extracted bbox
#31
dolfim-ibm
closed
3 weeks ago
0
Implement Lazy OCR option
#30
maxmnemonic
closed
2 weeks ago
1
fix: update vuln deps
#29
dolfim-ibm
closed
1 month ago
0
fix: constructor typings
#28
dolfim-ibm
closed
1 month ago
0
docs: improve examples
#27
dolfim-ibm
closed
1 month ago
0
feat: introducing docling_parse_backend
#26
maxmnemonic
closed
1 month ago
0
Add scale as an optional parameter for get_text_in_rect call
#25
maxmnemonic
opened
1 month ago
0
Make simple example that uses OCR
#24
maxmnemonic
closed
2 weeks ago
1
Update name of the parameter "table_structure_options.do_cell_matching" to better reflect meaning
#23
maxmnemonic
opened
1 month ago
0
fix: set page number using 1-based indexing
#22
vagenas
closed
1 month ago
0
Fixes for correct text extraction for table cells
#21
maxmnemonic
closed
1 month ago
0
feat: add simplified single-doc conversion
#20
vagenas
closed
1 month ago
0
fix: add easyocr to main deps for valid extra
#19
dolfim-ibm
closed
1 month ago
0
fix: expose ocr as extra
#18
dolfim-ibm
closed
1 month ago
0
pypdfium2: just forward input to PdfDocument directly
#17
mara004
closed
1 month ago
1
feat!: v1.0.0 release
#16
dolfim-ibm
closed
1 month ago
0
feat!: v1.0.0 release
#15
dolfim-ibm
closed
1 month ago
0
chore: switch to native Markdown export
#14
vagenas
closed
1 month ago
0
chore: update README
#13
vagenas
closed
1 month ago
0
fix: missing type for default values
#12
dolfim-ibm
closed
1 month ago
0
Next