issues
search
DS4SD
/
docling
Get your documents ready for gen AI
https://ds4sd.github.io/docling
MIT License
10.44k
stars
504
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
feat(ocr): added support for RapidOCR engine
#415
Swaymaw
opened
52 minutes ago
1
Title differenciation
#412
jedg75
closed
6 hours ago
0
Is it possible to fine tune with our own datasets?
#411
ninedesu
closed
3 hours ago
1
Using .DOCX format in cloud - suggestion on the below error?
#410
acsankar
opened
7 hours ago
2
chore: update the README
#409
PeterStaar-IBM
closed
16 hours ago
3
docs: add DocETL, Kotaemon, spaCy integrations; minor docs improvements
#408
vagenas
closed
16 hours ago
1
fix: force pydantic < 2.10.0
#407
dolfim-ibm
closed
1 hour ago
2
I get an error trying to export figures
#406
olsihoxha
closed
17 hours ago
2
Support Image path/url
#405
ezscode
opened
20 hours ago
0
Which type of Markdown is supported?
#404
thomasfrederikhoeck
opened
21 hours ago
1
Graphical user interface for parsed JSON?
#403
Upabjojr
closed
3 hours ago
1
chore: add downloads in README, security policy and update ci actions
#401
dolfim-ibm
closed
19 hours ago
1
Document normalization: warning on `checkbox-unselected`
#399
pierre-sigwalt
opened
1 day ago
1
analyzing the pdf is too slow
#398
langzichai
closed
22 hours ago
2
fix: python3.9 support
#396
dolfim-ibm
closed
1 day ago
1
Can support for widgets in Dify be considered?
#394
zswll2
closed
22 hours ago
1
feat(ocr): added support for PaddleOCR engine
#393
Swaymaw
opened
1 day ago
2
feat(ocr): Integrating PaddleOCR in Docling
#392
Swaymaw
closed
1 day ago
1
Docx cannot get pic info
#391
Zhengyu-Ju
opened
1 day ago
6
fix: propagate document limits to converter
#388
dolfim-ibm
closed
2 days ago
1
do we have a function to generate a folder which contains images folder and markdown file
#387
Zhengyu-Ju
opened
2 days ago
6
Python 3.9 Support?
#385
davidmezzetti
closed
1 day ago
3
Advanced chunking example
#384
vagenas
opened
2 days ago
1
Loading a pdf results in a StopIteration error
#383
charlescearl
opened
2 days ago
5
Table representation misaligned between PDF and DOCX
#382
vagenas
opened
2 days ago
2
feat: added support for exporting DocItem to an image when page image is available
#379
sh-gupta
closed
2 days ago
1
docs: fixed typo in v2 example v2
#378
gaspardpetit
closed
2 days ago
1
feat: expose ocr-lang in CLI
#375
dolfim-ibm
closed
2 days ago
1
chore: update dependencies
#374
vagenas
closed
2 days ago
2
Newcomers who want to start source code, how should I do it?
#372
aodingpeng
closed
2 days ago
1
chore: update lock of deps
#371
dolfim-ibm
closed
2 days ago
1
Bug
#370
patle22cute
closed
2 days ago
5
Add Parallelization Support to `convert_all()` Function with `num_worker` Parameter
#369
naufalso
opened
3 days ago
2
How to give HTML code as a string
#368
jaswanth-13
closed
3 days ago
1
Support for HOCR?
#366
4F2E4A2E
closed
2 days ago
2
LXML versions greater or equal than 5.0.0 are not allowed
#363
danitico
closed
3 days ago
4
Should the second "if" keyword in adapt_bbox from layout_utils.py rather be an "elif" keyword ?
#362
Raphilanthrope
opened
3 days ago
1
cannot import name 'TextPipelineOptions' from 'docling.datamodel.pipeline_options'
#360
adrianzhang
closed
3 days ago
1
export_to_markdown page separator
#359
GermeauSimon
closed
3 days ago
1
Docling <page_assemble_model> reading order algorithm
#358
mllife
closed
3 days ago
1
docling identified my entire page as a picture
#357
aodingpeng
opened
4 days ago
4
How do I use the downloaded ds4sd/docling-models?
#353
Runningwater2357
closed
6 days ago
3
How to ignore equation ?
#352
kh4n9373
closed
6 days ago
1
Syntax error while parsing object key (pdf with Chinese characters)
#351
danielkorzekwa
closed
4 days ago
1
ci: fix mergify
#350
dolfim-ibm
closed
6 days ago
1
feat: Extracting picture data for raster images found in PPTX
#349
maxmnemonic
closed
3 days ago
2
What are doctags
#348
pwright
closed
6 days ago
0
Analyzing PDf files is too slow
#346
langzichai
closed
3 days ago
1
Add LaTex and mathpix-markdown-it as outputs
#343
sirus20x6
opened
1 week ago
2
Add Markdown-based table serialization in chunking
#342
vagenas
opened
1 week ago
0
Next