-
- PHP Version: 8.3.1
- PDFParser Version: 2.10
### Description:
I have a muti page pdf (about 90 pages) . All pages contain table of similar data and all parse with _getDataTm_ without is…
-
### Initial Checks
- [X] I confirm that I'm on the latest version
### Description
[example1.pdf](https://github.com/user-attachments/files/16424947/example1.pdf)
[example2.pdf](https://github.…
-
Dear friends, good day.
I've installed yours perfect library, and it's work fine with sample:
But I meet one trouble, I have table in parsed pdf, I will pleasant if you give me sample code to …
-
2024-09-14 08:43:13.607 | WARNING | easyofd.parser_ofd::24 - FONT registerFont failed COURI.TTF: Courier New
2024-09-14 08:43:13.607 | WARNING | easyofd.parser_ofd::24 - FONT registerFont failed …
-
I Have the Clear pdf with proper images but this give
from unstructured.partition.pdf import partition_pdf
from PIL import UnidentifiedImageError
# Extract images, tables, and chunk text
…
-
When trying to parse this PDF _rose_production_split_pages.pdf_ (file was removed), we're getting error:
```
RangeError:
index out of range
# /Users/laykou/.rvm/gems/ruby-3.1.0/gem…
-
**Is your feature request related to a problem? Please describe.**
**您的特性请求是否与某个问题相关?请描述。**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
推理过程中,目前是先…
-
File upload currently only allows text of pdf, we could use our tika parser to enable other upload types.
As conversion would be done on the server, this would require adding a simple entrypoint to c…
-
I am encountering an issue with detecting text encoding from PDF files. While the encoding detection works correctly for .txt files, it consistently returns None for PDF files.
Steps to Reproduce:
…
-
1) I was using llama parse cloud to read the content from the scanned image in pdf. Llama parse was able to decode the text from the scanned image.
2)But starting from today I see that , llama parse …