-
**Describe the bug**
I am evaluating the UnstructuredClient for processing PDF documents and am encountering an issue with the Greek language text extraction. When I attempt to extract text from PDF …
-
I would like to contribute to the project by extracting data from **Arxiv**.
I would like to extract **titles** and **abstracts** or other metadata that might be helpul.
I think extracting the …
-
## Type
- [ ] General question or discussion
- [X] Propose a brand new feature
- [ ] Request modification of existing behavior or design
## What is the problem that your feature request …
-
## Reference
- [CTC算法详解](https://www.jianshu.com/p/0cca89f64987)
- [paper - 2014 - First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs](https://arxiv.org/…
-
Hi, thanks for your great project!
I am wondering how many training dataset instances you are used, such as COCO, OCR-VQA and A-OKVQA, did you just transform the original dataset with the template s…
-
A small collection of sample pictures to test the quality of the OCR algorithm would be helpful.
I.e. there should be about 15 pictures with different lighting conditions and different text size and …
-
Heya. I use mostly linux, but oddly enough I have had great results via tesseract on windows if I remember correctly.
I have some old documents (semi-old, paper print out only, office bills and suc…
-
**Describe**
I am using LayoutLM, would you please provide the script to prepare the OCR output (html format) for the RVL-CDIP? The readme mentions about Tesseract, however it will be much conventien…
-
I found, that some ECS papers has gif pictures for formulas and numbers.
For example: http://jes.ecsdl.org/content/157/3/J69.full
span class="inline-formula" id="inline-formula-38">
-
nice work
First a question:
could you provide your model -> model66.zip. I do not have a decent GPU :-( otherwise how much time it would take on google cloud with a GPU (T4, K80, P100, V100, P4)?…