ocr-papers Search Results

SongWWWWWW/LangChain-chatchat-hitwh #1

ocr tansition

This project uses RapidOCR for image OCR and Fitz in the PyMuPDF package for PDF OCR. To be honest, it is extremely difficult to recognize tables in some PDFs, especially in scholarly papers. Therefor…

shadow-of-Darkness updated 4 weeks ago

harvard-hbs-d3/d3-open-webui #1

Open WebUI still hallucinating quotes

We updated: enabled OCR and changed Top k to 40. We used the "Generative AI and the Nature of Work" paper and it still hallucinated 3 quotes. This ticket is to have a conversation between D3 and AM. …

ndbolligerD3 updated 6 days ago

Future-House/paper-qa #12

Any thoughts on OCR for older papers? (image-only)

EDIT: [a related OCR/NLP avenue](https://doi.org/10.48550/arXiv.2302.14045)

sgbaird updated 2 months ago

kermitt2/grobid #507

Is Grobid able to OCR papers ?

Does Grobid do OCR I am trying to get Grobid to process older PDF's like :- https://www.cs.cmu.edu/~crary/819-f09/Reynolds74.pdf https://chomsky.info/wp-content/uploads/195609-.pdf http://somr.…

AaronNGray updated 11 months ago

bosun-ai/swiftide #356

integrations: PDF segmentation and ingest via Aryn model / S…

I'm fired up about a rust implemented document parsing / embedding engine for my code and documents. Sadly, I don't see a good PDF ingestion in the code. Ideally, I'd like to import PDFs from acad…

jac-cbi updated 3 weeks ago

breezedeus/Pix2Text #142

Mathematical proofs in ArXiv papers recognized by OCR have a…

## Error screenshot ## Code snipp ```python from pix2text import Pix2Text img_fp = 'paper/2402.11867v3.pdf' p2t = Pix2Text.from_config() doc = p2t.recognize_pdf(img_fp) doc.to_mar…

bwnjnOEI updated 2 months ago

zeitlings/alfred-workflows #20

OCR workflow not working for specific documents

It's stuck at the embedding stage. I've tried a solution to #7 but it did not help. The output file such as `original_filename (ocr).pdf` is corrupted and can't be opened. Please see the debug below.…

snam24 updated 2 weeks ago

keras-team/keras-cv #924

Add MAXIM model

**Short Description** **MAXIM: Multi-Axis MLP for Image Processing**: I think it's a follow-up work of MaxViT from google. It shows a great performance on the following low-vision task, i.e. for OC…

innat updated 1 month ago

yangxue0827/R2CNN_FPN_Tensorflow #15

No module named rotate_polygon_nms

Hi, I run the code but there are some error happened. When i run the inference.py, there is no mistake, and get the correct result. but when i run the inerence1.py, there are some mistake, the messa…

UpCoder updated 6 years ago

Excidos/ComfyUI-Documents #1

Working well with OCR

Thank you for the excellent node. I created a workflow about it on [OpenArt](https://openart.ai/workflows/fish_intent_33/pdftoslides-in-comfyui/7rIk4LKwjsKx8xzTyiEU). It works well with OCR. Howeve…

dseditor updated 4 months ago

425 results for ocr-papers

425 results
for ocr-papers