-
If the knowledge base consists of image materials or PDF files containing image information, is it currently not supported? Will there be support for OCR recognition technology for images in the futu…
-
The content before and after the table in the pdf file cannot be processed normally, and the content of the table and the content of the article will be confused.
![pdf-talbe](https://github.com/ad…
-
I am new to LlamaIndex and LlamaParse
I am using below code to parse PDF file:
def main():
load_dotenv()
nest_asyncio.apply()
LLAMA_CLOUD_API_KEY = os.getenv("LLAMA_CLOUD_API_KEY")
…
-
Show a progress bar for the pdf parsing
-
Introduce a new library type called Magazines aimed for Magazines. The library behaves in the following ways:
- [x] Dedicated parser with a limited set of naming conventions
- [x] PDFs will open w…
-
😭 does extracting images not really work that well?
```
# Uncomment if you are in a Jupyter Notebook
import nest_asyncio
nest_asyncio.apply()
from llama_parse import LlamaParse # pip install…
-
e.g. https://github.com/facebookresearch/nougat
https://github.com/microsoft/table-transformer
-
`[development] irb(main):035:0> pdf
-
### Context
Hi, I'm parsing a PDF using version 0.4.4 of llama-parse and using gpt-4o:
```python
max_timeout = 6000
num_workers = 4
check_interval = 10
parser = LlamaParse(
result_type=…
-
Hi,
I have this wierd error:
And I am getting this result by `x =File.open('~/billapp.pdf', 'rb')`
I am adding that PDF here
[billapp.pdf](https://github.com/yob/pdf-reader/files/970461/…