-
http://tintin.sourceforge.net/mtts
Seems like a good way to allow people to enable UTF-8 if they'd like to use it.
vadi2 updated
10 months ago
-
OCR text created by the derivation process can be exposed as annotations for books and image-based media, enabling presentation and consumption of the text by IIIF clients.
-
some of the changes which encode utf8 strings as bytes (and then decode them) seem to have left sendXMLRPC behind. Here is the issue:
I believe the basic issue is that in the upgrade that handled…
mgage updated
9 months ago
-
### Font
NotoSansMath-Regular.otf
### Where the font came from, and when
Site: https://github.com/googlefonts/noto-fonts/blob/e60eedc24cf3fc7e47e6d9eb488820ed3aa04923/unhinted/otf/NotoSansMath/No…
-
What’s your stance on curly quotes, guillemets, en/em dashes and other non-ASCII punctuation? I see the ellipsis `…` on the top left key ↓ and degree sign `°` on bottom right ↗.
MessagEase puts fre…
-
Running `tesserocr-recognize` as a processing worker has some side effects. It is worth mentioning that the logging error does not occur when running in a docker environment.
1) Dump of the ocrd to…
-
It would appear that there is some real disagreement about text-transform and a gap of understanding between spec authors and implementation realities that I was not previously aware of (the gap, not …
-
Running `tesserocr-recognize` as a processing worker has some side effects. It is worth mentioning that the logging error does not occur when running in a docker environment.
1) Dump of the ocrd to…
-
Ran with docker `siletypesetter/sile:v0.14.13`:
```
\begin[papersize=a6]{document}
\use[module=packages.lorem]
\use[module=packages.dropcaps]
\dropcap[join=true, lines=4]{J}\lorem[words=50]
\e…
-
### Environment
* **Tesseract Version**: tesseract alpha - 4.0.0
* **Platform**: Linux Ubuntu 16.04 LTS
Tesseract lstmtraining is used to train Korean language. The following error has occurre…