segmentation-based-text-recognition Search Results

abhisheks008/DL-Simplified #681

Separating text from image

### Deep Learning Simplified Repository (Proposing new issue) :red_circle: Separating text from image : :red_circle:Aim of the project is to provide users with a code that can help them take out t…

harshmishra19 updated 1 month ago

tesseract-ocr/tesseract #4281

How to use Tesseract in a multi-threaded environment?

### Current Behavior `#include #include #include #include #include #include #pragma comment(lib, "tesseract54.lib") std::mutex io_mutex; void performOCR(const std::string& imagePath…

kinghelong updated 2 days ago

RedHenLab/multi-modal-emotion-prediction #1

Many thanks for the contribution, although the utterance segmentation is not a part of your work (the IEMOCAP emotion dataset is already segmented into utterances), do you have any idea about any too…

amirim updated 6 months ago

neulab/explainaboard_web #558

Missing example custom dataset format or system outputs

We are missing documentation for examples in the following tasks + file types. (Based on the file types that we do accept but are missing examples.) - named-entity-recognition: system output - js…

noelchen90 updated 1 year ago

nnop/notes #14

deep learning papers

# speech recognition - Soltau, Hagen, Hank Liao, and Hasim Sak. "Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition." arXiv preprint arXiv:1610.09975 (201…

nnop updated 6 years ago

OCR-D/spec #77

OCR on line vs word level

The current [specification](https://ocr-d.github.io/glossary#OCR) is agnostic about which **level of segmentation** OCR is supposed to operate on, either `TextLine` layout input (for `TextLine`, `Word…

bertsky updated 5 years ago

GasimV/Commercial_Projects #2

Speech Processing Models

`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…

GasimV updated 1 week ago

OCR-D/zenhub #17

Importing Transkribus PAGE-XML

# Current situation Users cannot readily use the PAGE-XML results of Transkribus in an OCR-D environment, because Transkribus' flavor of PAGE-XML is based on the older 2013 variant and contains pro…

krvoigt updated 2 years ago

emecas/commitit #175

10 Best Java NLP Libraries & Tools

https://www.bairesdev.com/blog/java-nlp-libraries-tools/

emecas updated 1 month ago

junxnone/aiwiki #192

ML Tasks Image OCR TextScanner

## Reference - [paper - 2019 - TextScanner: Reading Characters in Order for Robust Scene Text Recognition ](https://arxiv.org/pdf/1912.12422.pdf) - [旷视研究院提出TextScanner：确保字符阅读顺序，实现文字识别新突破](https://z…

junxnone updated 1 year ago

489 results for segmentation-based-text-recognition

489 results
for segmentation-based-text-recognition