-
Hi,
Thank you for this excellent repository! I appreciate how you organize and present the papers; it’s very neat.
I am curious whether the model proposed in the "Unifying Vision, Text, and Layo…
-
This project uses RapidOCR for image OCR and Fitz in the PyMuPDF package for PDF OCR. To be honest, it is extremely difficult to recognize tables in some PDFs, especially in scholarly papers. Therefor…
-
We updated: enabled OCR and changed Top k to 40. We used the "Generative AI and the Nature of Work" paper and it still hallucinated 3 quotes.
This ticket is to have a conversation between D3 and AM.
…
-
EDIT: [a related OCR/NLP avenue](https://doi.org/10.48550/arXiv.2302.14045)
-
I'm fired up about a rust implemented document parsing / embedding engine for my code and documents. Sadly, I don't see a good PDF ingestion in the code.
Ideally, I'd like to import PDFs from acad…
-
Does Grobid do OCR I am trying to get Grobid to process older PDF's like :-
https://www.cs.cmu.edu/~crary/819-f09/Reynolds74.pdf
https://chomsky.info/wp-content/uploads/195609-.pdf
http://somr.…
-
## Error screenshot
## Code snipp
```python
from pix2text import Pix2Text
img_fp = 'paper/2402.11867v3.pdf'
p2t = Pix2Text.from_config()
doc = p2t.recognize_pdf(img_fp)
doc.to_mar…
-
It's stuck at the embedding stage. I've tried a solution to #7 but it did not help.
The output file such as `original_filename (ocr).pdf` is corrupted and can't be opened. Please see the debug below.…
-
**Short Description**
**MAXIM: Multi-Axis MLP for Image Processing**: I think it's a follow-up work of MaxViT from google. It shows a great performance on the following low-vision task, i.e. for OC…
innat updated
1 month ago
-
Hi, I run the code but there are some error happened.
When i run the inference.py, there is no mistake, and get the correct result.
but when i run the inerence1.py, there are some mistake, the messa…