-
While evaluating whether llama_parse would work for our use case, I noticed that llama_index appeared to ignore a large portion of the text in the test document I used.
When I opened said document …
-
### Operating system
Android
### Joplin version
3.0.8
### Desktop version info
_No response_
### Current behaviour
1. Open a specific note.
2. Press 3-dot on top right.
3. Choose share
4. …
-
Txt-Files can be uploaded to paperless-ng via Webfrontend and will be consumed.
1. This Files won't be converted to pdfs
2. If you would like to edit (in paperless) the document, it is not shown in …
-
To compare different pipelines (LLMs, pdf2img, pdf2txt) we need a benchmark.
## 1. Choose a sub-set of datasheets of each manufacturers
* consider special PDFs that need OCR
* scrambled text
#…
fl4p updated
18 hours ago
-
I loaded a .odt Text file which contains a table and several text frames. When saving it as .odt, everything looks fine, but when saving it as PDF, all elements except the table are missing.
` Fil…
-
### Bug Description
Hey, so to sum it up, I create a SimpleDirectoryReader with a PDFReader as an extractor and an s3 bucket as an input_dir, with also s3 as a fs.
ThenI call load_data() which leads…
-
Submit a PDF/Word/Text file on HelloIITK, including a link to your GitHub repository and the hosted game.
Ensure it includes information on where to find code quality reports, test coverage reports, …
-
when using printing to pdf, the text in pdf can't copy any more.
-
Hello.
I would like your help with this.
I need to search inside several pdfs for a specific text.
Is there a way to get text from a pdf opened with chrome?
Or is there a way to perform search in…
-
In ssimms/pdfapi2/issues/86, @muehlenp reports:
If the input PDF file is read-only the output PDF is corrupt. E.g. KDE's okular shows "Some errors were found in the document, Okular might not be ab…