-
# Background
The E2E tests:
1. Do a complete submission
2. Use the API to validate that Tribal audits are correctly suppressed
This works because the API points at the `dissemination` tables…
-
Given this code:
```
import openparse
basic_doc_path = "mydoc.pdf"
parser = openparse.DocumentParser(
table_args={
"parsing_algorithm": "unitable",
"min_table_confidence":…
-
Add UI for permit summary page and permit PDF viewer page (see screenshots from figma): @wdwiii please just focus on building out the UI for this section.
- [x] Add Permit summary page (for now you …
-
the attached input document contains text then a table followed by some text, we want the text file to be the same as the input pdf file.
![input_page](https://github.com/user-attachments/assets/fe…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
my pdfs has both text and tables
i need to extract both seperately to maintain data qu…
-
**Feature request**
We are using pdfminer.six for convert tables in PDF (we using lattice and stream) and found that Thai characters has some issues that it cannot detect proper fonts event the fo…
-
During the face to face meeting ([reference](https://github.com/wmo-im/CCT/wiki/20.to.22.September.2023)) the team decided to publish GRIB and BUFR codes to the WMO Codes Registry after every approved…
-
I want to crop all the figures/images/tables in one pdf. Can get the page number of each figure in doc.figures[x]?
-
See: http://www.princexml.com/doc/rotating/#rotating
-
### Describe the bug
**context**
I am trying to generate a pdf of my sphinx documentation. I am using the xelatex engine. There are many tables in the markdown file I am using.
**expectation**
…