-
### question
when i use WordFormatOption() function to define a converter,i found that i cannot get the picture info ,i see the code of class SimplePipeline,l found that there is no property to set g…
-
The table detection and table formatting is working wonderfully for targeting markdown; however, the PyPDFium2 formatting of non-tables is quite lacking. Unfortunately, I need to move away from pymupd…
xdave updated
3 weeks ago
-
Air permits are stored in the airbranch `dbo.APBPERMITS` table as varbinary data. Word documents are stored in the `DOCPERMITDATA` column, and PDF documents are stored in the `PDFPERMITDATA` column.
…
-
In the table in this document https://datadryad.org/docs/HumanSubjectsData.pdf
Please update the table text under 'Direct identifiers (none allowed)' such that
"Facial photograph or comparable …
-
Hey, thanks for awesome doc toolkit.
I tried to run `pdf_path = "tests/test_files/direct_extract/single_column.pdf"`
and got a following error:
```
2024-11-02 17:47:58,569 - rapid_layout - INF…
-
The resulting pdf does have a lot of useful links, that allow the reader to move about the document. However a table of content, where the individual chapters are accessible from the sidebar of many p…
-
### Requested feature
We have much finer grained bbox information using the docling-parse-v2, which could be easily leveraged by layout and table model for improved accuracy.
-
### Bug
* Out of order conversion: it would be nice if headers(UNCLASSIFIED , American Football Conference (AFC) , AFC East) appear after text fields
* Keys missing their values: the text fields hav…
-
Hello, thanks for the great work done here, relying on `pandoc` is a motivation I truly valuate :ok_hand:!
I try to render a markdown table containing emoji with the following configuration but doe…
-
I'm encountering an issue with the tagpdf package when using the sidewaystable environment. In my case, the tag order for table content follows the LaTeX input sequence rather than reflecting the tabl…