-
### Description
Currently we have an invoice generation system which generates PDF by converting from HTML. I was looking it to migrate to typst for PDF generation.
Issue is our invoice is 8 colum…
-
### Describe your problem
Why not convert tables parsed from PDF and Word files into Markdown format? Is it because HTML format is better recognized by LLM?
Table Markdown format, I mean like th…
-
### What would you like to do?
Report an issue on quarto.org
### Description
https://quarto.org/docs/reference/formats/pdf.html#tables
-
### Self Checks
- [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones.
- [X] I confirm that I am using English to su…
-
Hi,
After running the command
```
$ nougat.exe .\pharmaceutics-16-00226.pdf -o .\output -m 0.1.0-base
```
I get successful downloads, but then an error.
I get this same error on Windows and L…
-
Try Tabula - http://tabula.technology/
znmeb updated
8 years ago
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
my pdfs has both text and tables
i need to extract both seperately to maintain data qu…
-
**Describe the bug**
NotImplementedError: File format not supported
I am facing this error where the expected result is the list of tables
However I am getting a Implementation exception instead
…
-
Attach (recommended) or Link to PDF file here:
Configuration:
- Web browser and its version: Firefox 110.0
- Operating system and its version: Manjaro 22.0
- PDF.js version: 3.3.56
- Is a brows…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
How to handle complex PDFs,such as PDFs with images, tables, etc.