ocr-paper Search Results

1000+ results
for ocr-paper

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Unstructured-IO/unstructured #2939

Text Extraction Issue: Greek Language PDFs Rendered with Inc…

**Describe the bug** I am evaluating the UnstructuredClient for processing PDF documents and am encountering an issue with the Greek language text extraction. When I attempt to extract text from PDF …

DarioBernardo updated 6 months ago
3
LAION-AI/Open-Assistant #2076

Arxiv: Research papers

I would like to contribute to the project by extracting data from **Arxiv**. I would like to extract **titles** and **abstracts** or other metadata that might be helpul. I think extracting the …

SkanderHellal updated 1 year ago
3
ArchiveBox/ArchiveBox #1012

Feature Request: OCR archived PDF files to extract titles an…

## Type - [ ] General question or discussion - [X] Propose a brand new feature - [ ] Request modification of existing behavior or design ## What is the problem that your feature request …

turian updated 2 weeks ago
10
junxnone/aiwiki #189

ML Tasks Image OCR CTC

## Reference - [CTC算法详解](https://www.jianshu.com/p/0cca89f64987) - [paper - 2014 - First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs](https://arxiv.org/…

junxnone updated 1 year ago
1
open-mmlab/Multimodal-GPT #14

how many training instances are used?

Hi, thanks for your great project! I am wondering how many training dataset instances you are used, such as COCO, OCR-VQA and A-OKVQA, did you just transform the original dataset with the template s…

TobiasLee updated 1 year ago
1
falktan/ovip #3

Test cases

A small collection of sample pictures to test the quality of the OCR algorithm would be helpful. I.e. there should be about 15 pictures with different lighting conditions and different text size and …

falktan updated 3 years ago
4
tesseract-ocr/tessdoc #78

[Documentation] Recommend one or two GUIs for people new to …

Heya. I use mostly linux, but oddly enough I have had great results via tesseract on windows if I remember correctly. I have some old documents (semi-old, paper print out only, office bills and suc…

rubyFeedback updated 2 years ago
1
microsoft/unilm #216

Provide a script to get OCR result for RVL-CDIP in layoutLM

**Describe** I am using LayoutLM, would you please provide the script to prepare the OCR output (html format) for the RVL-CDIP? The readme mentions about Tesseract, however it will be much conventien…

yaoliUoA updated 3 years ago
5
CederGroupHub/LimeSoup #7

Formulas in gif format

I found, that some ECS papers has gif pictures for formulas and numbers. For example: http://jes.ecsdl.org/content/157/3/J69.full span class="inline-formula" id="inline-formula-38">

OlgaGKononova updated 5 years ago
2
jainammm/TableNet #5

how to proceed?

nice work First a question: could you provide your model -> model66.zip. I do not have a decent GPU :-( otherwise how much time it would take on google cloud with a GPU (T4, K80, P100, V100, P4)?…

philipus updated 3 years ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for ocr-paper

1000+ results
for ocr-paper