-
### Deep Learning Simplified Repository (Proposing new issue)
:red_circle: Separating text from image :
:red_circle:Aim of the project is to provide users with a code that can help them take out t…
-
I'm extensively using the `align` API in my product ([Enjoy App](https://github.com/ZuodaoTech/everyone-can-use-english), a language learning tool).
Here is the standard procedure:
1. The user u…
-
standard segmodel creates bad polygons on [standard layout and quality image](https://digi.vatlib.it/iiifimage/MSS_Reg.lat.10/Reg.lat.10_0023_fa_0010r.jp2/full/full/0/default.jpg) and leads to bad rec…
-
[The format of the issue]
Paper name/title:
Project link:
Paper link:
Code link:
amusi updated
10 hours ago
-
### Current Behavior
```
#include
#include
#include
#include
#include
#include
#pragma comment(lib, "tesseract54.lib")
std::mutex io_mutex;
void performOCR(const std::string& ima…
-
Many thanks for the contribution,
although the utterance segmentation is not a part of your work (the IEMOCAP emotion dataset is already segmented into utterances), do you have any idea about any too…
-
We are missing documentation for examples in the following tasks + file types.
(Based on the file types that we do accept but are missing examples.)
- named-entity-recognition: system output - js…
-
# speech recognition
- Soltau, Hagen, Hank Liao, and Hasim Sak. "Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition." arXiv preprint arXiv:1610.09975 (201…
-
The current [specification](https://ocr-d.github.io/glossary#OCR) is agnostic about which **level of segmentation** OCR is supposed to operate on, either `TextLine` layout input (for `TextLine`, `Word…
-
# Current situation
Users cannot readily use the PAGE-XML results of Transkribus in an OCR-D environment, because Transkribus' flavor of PAGE-XML is based on the older 2013 variant and contains pro…