-
### What?
Similar to Cursor AI's feature, we can add the ability for the user to be able to add links to any websites and allowing the LLM to reference them during the conversation.
### How?
1. W…
-
### Is there an existing issue for this?
- [x] I have searched the existing issues
### Describe the bug
I am encountering a strange problem.
In our application I have a single controller where PUT…
-
In OpenSearch 2.13, we released [**text chunking processor**](https://opensearch.org/docs/latest/ingest-pipelines/processors/text-chunking/). This processor enables users to chunk documents to avoid i…
-
### What happened?
I'm creating an API with Flask. The other side will send me a file and I will save it to chroma database on my side. Chroma.add will terminates my program without any exception. Wh…
-
I am currently partitioning a docx file harnessing unstructured with the next input params:
```json
{
"filename": "document.docx",
"response_type": "application/json",
"coordinates": fals…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
@dosu iam using LlamaParseJsonNodeParser, for parsing the documents,iam using 8196 conte…
-
This write-up was prompted by [this zulip topic](https://neuroinformatics.zulipchat.com/#narrow/channel/406001-Movement/topic/Output.20formats).
## The problem
We have so far taken a pluralistic…
-
Is there a feature in the pipeline to support web scraping functionality - similar to what the LangChain library has to offer (https://python.langchain.com/v0.1/docs/use_cases/web_scraping/).
It is…
-
```
[](https://localhost:8080/#) in extract_data_from_pdf(pdf_path)
57 # Function to extract text using the unstructured library
58 def extract_data_from_pdf(pdf_path):
---> 59 eleme…
-
I Have the Clear pdf with proper images but this give
from unstructured.partition.pdf import partition_pdf
from PIL import UnidentifiedImageError
# Extract images, tables, and chunk text
…