-
# Description
Some use cases need to get access to information stored in the OCR format:
- OCR correction scenario
- access to word confidence (see Issue #68)
- access to other kind of informati…
-
I have a PDF that is composed only of photos of text: can pdf-text-extraction pull text from that?
Specifically, it's a 5-page PDF that is purely photos of pages of a book. It's intentionally suppo…
-
1, 下载deep_ocr_workspace.zip
2,docker pull jinpengli/deep_ocr_cpu_docker:latest
3,docker run -ti --volume=${HOME}/deep_ocr_workspace:/workspace jinpengli/deep_ocr_cpu_docker:latest /bin/bash
4,pytho…
-
Hi
I tried the same page with same setup with both Kraken 5.x and Kraken 4.x with provided Arabic_best.ml and there is more errors in the latest version (5.x) I think this relate to changes in segmen…
bmwmy updated
5 months ago
-
Currently character segmentation uses the first-order derivative of a vertical histogram. The segmentation then slices the character areas ignoring the first 2 pixels. This is not ideal since it is …
-
Due to the lack of a specification on that aspect, our processors have no or no uniform way to inform the user whether or not a GPU device is used (or even parameterise which one to prefer). Here's th…
-
@sunke123 @wondervictor @welleast @leoxiaobin @bearcatt Thanks for open sourcing the source code , its a great work . I have few queries
Q1 when i used the HRNet + OCR model on BDD100k and on cust…
-
hello ,@sergiomsilva, could you release the OCR training code now? I follow your WPOD-NET, and get good results in our datasets, but this is only LP detection. And OCR result is not well. So hope you…
-
```
Examples:
* To segment existing regions into lines (and only lines) only:
`segmentation_level="line"`, `textequiv_level="line"`, `model=""`
* To segment existing regions into lines (and o…
-