-
## Prework
- [x] Read and agree to the [code of conduct](https://www.contributor-covenant.org/version/2/1/code_of_conduct/) and [contributing guidelines](https://github.com/posit-dev/great_tables/b…
-
Certain types of reports that only contain a few columns have inconsistent sizing in the PDF report. This results in both inconsistently sized tables and charts in the PDF reports, sometimes within th…
-
Hi,
PDF files are converted to DOCX and then tables are extracted from DOCX.
There are hidden columns and hidden text in the tables.
Is there a way to ignore the hidden columns and text during co…
-
I was directed to this repo in order to make a request for an nth fragment selector.
The problem I am trying to solve is to be able to select and update the CSS of tables split across PDF pages. Fo…
-
Given this code:
```
import openparse
basic_doc_path = "mydoc.pdf"
parser = openparse.DocumentParser(
table_args={
"parsing_algorithm": "unitable",
"min_table_confidence":…
-
Hello,
This concept allows to attach one to many documents (photos, pdf, ..) to every database classes.
It does not currently exist in the TDH data model, however it does exist in the TWW and TWA da…
-
I am exporting a large PDF to tables then exporting them to csv but I am getting multiple pages. So if the PDF is 1000 pages long, the output expected is 1000 single csv -- one for each page. The or…
dml5 updated
2 months ago
-
From @petervwyatt :
Like programming language features, PDF has several different methods for embedding a file. Let me skim the FOP code and see if I can understand what option(s) they support - an…
-
### Description
bad tables. They are not consist with pdf.
### (Optional:) Please add any files, screenshots, or other information here.
_No response_
### (Required) What is this issue most closel…
-
See: http://www.princexml.com/doc/rotating/#rotating