-
Is there a way to convert the parsing results to CSV and the CSV contains all the elements or objects like text,table,image path etc
I am asking similar to unstructured I/O method or function
…
-
Lots of people want to use to scrape pdf links which we already support by default but looking at some of the logs and comments on the docs we still run into issues. Would be good to have more reliabl…
-
I am using LlamaParse via the REST API: https://docs.cloud.llamaindex.ai/llamaparse/getting_started/api
I have PDF I am trying to parse: https://s3.us-west-2.amazonaws.com/docs.scbbs.com/docs/test/…
-
We need to parse the PDF report to a machine-readable format to facilitate the Python script reading the file. Here are some existing tools:
- To JSON:
- [ ] [unstructured](https://github.com/Unstru…
-
Hey,
I'm having a hard time parsing pdf files with two vertical columns filled with text. It actually sometimes captures the right order, but often does not. I'm parsing it into markdown.
For ex…
-
Hi
Thanks for the great tool!
I was recently trying to use it with GPT4o but I keep getting this error -
```Error while parsing the file '/NS/llm-1/work/afkhan/Work/Paper.pdf': Failed to parse t…
-
**Describe the Issue**
I deployed the V1 version on a VPS server, and it started running well at first, but after a while, the system resources would be overloaded, causing the firecrawl worker to st…
-
Currently, we scrape sources based on a predefined frequency. Adding new sources requires writing specific code for each one. While [Scrapy](https://github.com/bitcoinsearch/scraper?tab=readme-ov-file…
-
### Question Validation
- [X] I have searched both the documentation and discord for an answer.
### Question
I used OllamaEmbedding llama3:8b for local testing. After completing the index building …
-
When I try to run the multimodal_report_generation_agent.ipynb from the example locally, it reports an error in the third code block:
import llama_index.core
import os
PHOENIX_API_KEY = "xx…