-
**Describe the bug**
With the addition of context aware chunking: the --model parameter in data generate is used for two competing places that makes it impossible to interact with a remote teacher …
-
### Feature Name
swarmauri_community/parsers/concrete/TabulaPDFParser.py
### Feature Description
Using Tabula, extract tables from PDF files
### Motivation
To enable parsing of pdf documents
###…
-
### Description of the bug
Recently, in the documents to process, I received a document scanned with a Lexmark machine that become blank after saving with “clean” option set to True. This behavior …
-
We're using the lib for validation and pdf rendering. And we have a bunch of demo documents, zugferd 1 and 2 as pdf and Xrechnug as xml. All docs working with 2.13. But many of them now have problems …
-
As a backend member,
I want to conduct on the first POC about a full basic LLM workflow. So that I can integrate to the base backend.
## The flow
- Read and parse PDF document
- Chunk documents and m…
-
Let's take as an example: https://www.stateninformatie.provincie-utrecht.nl/api/v1/meetings/8992/documents/23264
In the PDF, we find a multi-column style like this:
![image](https://user-images.…
-
**Is your feature request related to a problem? Please describe.**
Sometimes existing Pdf pages need to be embedded into QuestPDF generated pdfs.
**Describe the solution you'd like**
A Visual Ele…
-
WCAG requirements mention PDF documents are not accessible, and there should be an open document format (e.g.HTML) available for users with accessibility needs.
We need to understand how this impacts…
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find a sim…
-
need to be able to upload "forms" in text or word documents, scanned images, pdf docs, json files, jpeg/png images, mp4 and other video clips, and audio clips for "Q/A, summarization " etc with OPEA R…