-
## DESCRIPTION
OCR will be a new type of project that will be release in the future, in this series of issues we are going to put all the relevant parts to create a project and consume it properly.
…
-
## DESCRIPTION
OCR will be a new type of project that will be release in the future, in this series of issues we are going to put all the relevant parts to create a project and consume it properly.
…
-
### Title of the resource
OCR with Google Vision API and Tesseract
### Resource type
External Resource
### Authors, editors and contributors
Isabelle Gribomont, Liz Fischer, Ryan Cordell, Clemens…
-
[Questioning_development_review.PDF](https://github.com/run-llama/llama_parse/files/14429185/Questioning_development_review.PDF)
I wasn't intentionally testing OCR, but here we are. I won't share a…
-
It may be worth trying some alternative OCR libraries, as discussed in this article: https://www.statcan.gc.ca/en/data-science/network/character-recognition
Might be a good idea to have these alter…
-
To repro, create a custom pipeline config with only one step: ocr-pdf. Then try to OCR a pdf. You get an empty download.txt.
My config looks like this:
![image](https://github.com/Stirling-Tools…
-
Hi, I want to train trocr-small-printed for license plate ocr for my school work. However, when I use trocr model from huggingface, the decoded outputs are garbage English words and not meaningful for…
-
Integrate a feature into the "AI Chat Assistant" tool that allows users to extract text data from image files using Optical Character Recognition (OCR) technology. This will enable the chatbot to proc…
-
**Describe the bug**
A strange one.
`IndexError: list index out of range` when OCR'ing a portion of a pdf doc, but depending on the split size, it doesn't always happen. My guess is that the firs…
-
Hello guys. Thank you so much for this brilliant Model.
I'm aware that Donut is an OCR-free model which does not rely on an OCR input. When I performed some tests (fine-tuning the model), I realized…