segment-ocr Search Results

mittagessen/kraken #615

It there a way to simply call kraken recognizer from code?

I would like to call kraken recognition from code and get hocr. However, just calling `kraken --hocr -i image.png image.txt binarize segment ocr --reorder` will initialize model on each image recognit…

dantetemplar updated 3 months ago

tesseract-ocr/tesseract #1192

Noise characters recognized with bbox as the entire page

### Environment * **Tesseract Version**: v4.00.00dev-692-gad5ee18 with Leptonica * **Commit Number**: ad5ee18 * **Platform**: MAC OS 16.7.0 Darwin Kernel Version 16.7.0: Thu Jun 15 17:36:27 PDT 2…

TerryZH updated 3 years ago

gbhl/bhl-segment-definition #36

The University of Kansas science bulletin

swlny updated 4 months ago

OCR4all/LAREX #290

AlternativeImages are all moved to page level when saving

I just found that the new PAGE-XML editing facility still has one bug: it does not retain `AlternativeImage` under their original segment (which could be region, line, word or even glyph), but moves t…

bertsky updated 1 month ago

ogkalu2/comic-translate #91

New update is incredible amazing !!

Hey buddy, this new update is incredible. I was absolutely blown away. Being able to edit the translated text already solves most of my problems. I just have one small favor to ask. As you can see …

Sterben1579 updated 1 month ago

tesseract-ocr/tesseract #2738

Duplicate Characters in Output Stream

Please refer to the following link: https://github.com/tesseract-ocr/tesseract/pull/2635 This concerns changes made to lstm_choices_mode. Unless I misunderstand what these options are suppose…

woodjohndavid updated 6 months ago

HMS-MLKit/HUAWEI-HMS-MLKit-Sample #42

Cloud Text Recognition don't work

I get an "urlList is empty, failed to detect cloud text" error when I try to recognize the text from the Album right in your sample code. Please fix it

kobidy1102 updated 4 years ago

PaddlePaddle/PaddleNLP #6325

[Question]: UIE-X 模型在预测的时候，预测结果box位置是正确的，但是box中的字符不全，存在缺失的情况

### 请提出你的问题当输入图片，输出预测结果时，会出现某个字段的预测box是正确的，但是box中包含的字符存在缺失的情况在确定预测的box包含的字符这一步，会用到计算字符宽度的逻辑，这个字符宽度目前是用的平均宽度，实际上一行文本同时包含中文、数字、字母时，每个字符宽度是不一样的，那么此处使用平均宽度来计算当前字符box是否在预测的box中, 会判断错误代码中计算字符宽…

liangxinxin updated 1 month ago

tesseract-ocr/tesseract #1902

text2image Null box at index 0

root@ubuntu:/home/administrator/tesseract-master# /usr/local/bin/text2image --text=/home/administrator/langdata/chi_sim/chi_sim.training_text --fontconfig_tmpdir=/tmp/font_tmp.HKWX4LOUh0 --fonts_dir=…

YiWenFY updated 1 year ago

dimdenGD/chrome-lens-ocr #8

TypeScript support

Currently you have to depend on README to know the APIs, by adding types you can help save time in development. It should be easy since everything is already documented though. Should I make a PR?

sglkc updated 2 weeks ago

1000+ results for segment-ocr

1000+ results
for segment-ocr