issues
search
tesseract-ocr
/
langdata
Source training data for Tesseract for lots of languages
Apache License 2.0
837
stars
889
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Tatar language data
#305
rsabirov
closed
1 month ago
2
Rename frk -> deu_latf (ISO 639-3, ISO 15924)
#304
stweil
closed
8 months ago
1
special characters missing from `nor` and `dan` `desired_characters`
#303
FTHuld
opened
10 months ago
0
Trouble with "separator lines" made of **** or ----- or =======
#301
callegar
opened
1 year ago
1
Update desired_characters
#300
KhanbalaRashidov
opened
1 year ago
0
Language pack request: Accented Belarusian
#299
tryzniak
closed
2 years ago
2
Add Wynn, Eth, and Ash to Middle English script so it can also be used for Old English (Latin)
#298
grantbarrett
opened
2 years ago
1
install language
#297
englianhu
closed
2 years ago
0
Language Request: Kurdish Sorani (Central Kurdish)
#296
makwanbarzan
opened
2 years ago
1
Failed to initialise tesseract engine: .net 6.0 [Tesseract 4.1.1 + Tesseract.Data.English 4.0.0]
#295
J35P1N
closed
2 years ago
2
Added Akkadian unicharset file
#157
wincentbalin
closed
3 years ago
1
I'm ssory
#156
bykovman
closed
3 years ago
0
Cannot show Persian numbers
#155
netwons
opened
3 years ago
0
ful
#154
tukulor
opened
3 years ago
8
Santali Language (Ol Chiki script) OCR
#153
Prasanta-Hembram
opened
4 years ago
0
Balinese Script OCR
#152
gindrawan
opened
4 years ago
26
Add Akkadian langdata
#150
wincentbalin
closed
3 years ago
15
About Uyghur Language recognition
#149
rustam
opened
5 years ago
0
Normalize unicode in texts
#148
stweil
closed
5 years ago
0
Can't encode transcription
#147
peterbence3
closed
5 years ago
3
Update description for repo - Suggested Text:
#146
Shreeshrii
opened
5 years ago
0
Romanian Cyrillic
#145
bvrabete
opened
5 years ago
4
Add Apache license file
#144
stweil
closed
5 years ago
0
Fix extra intra-word spacing in Chinese and Japanese (GitHub issue #991)
#143
stweil
closed
5 years ago
1
Fix Chinese and Japanese langdata config
#142
stweil
closed
5 years ago
1
[tha] Please add support for Thai Character "Phinthu"
#141
agguser
opened
5 years ago
0
Changes for Kurdish
#140
Shreeshrii
closed
5 years ago
4
what is the use of Traintext ? Shouldnt it be images instead?
#139
himanshk96
closed
5 years ago
1
Arabic Numbers
#138
AhmadAlhati
opened
5 years ago
1
Some characters missing in spa.training_text makes Tesseract fail recognizing them
#137
diegodlh
opened
5 years ago
2
Added special characters to swedish desired_characters file
#136
aslamy
closed
5 years ago
1
Missing many special characters in desired_characters file (Swedish)
#135
aslamy
opened
6 years ago
0
added 60k urdu words
#134
laamalif
opened
6 years ago
0
this is not an issue, i just need some guaidline for urdu dataset, any expert please?
#133
ghost
closed
6 years ago
0
Error when trying to make ScrollView.jar
#132
topherseance
closed
6 years ago
1
Add Indic numerals and missing punctuation to Arabic
#131
mustafa0x
opened
6 years ago
4
Geresh and Gershayim are not included
#130
yarons
opened
6 years ago
11
Maqqaf recognition
#129
yarons
closed
5 years ago
2
hin.wordlist
#128
mymonoo
closed
5 years ago
0
Do not merge: update based on tessdata_fast at 7274cfa
#127
Shreeshrii
closed
5 years ago
5
Add Javanese Script for jav-java
#126
Shreeshrii
opened
6 years ago
55
Please help me..
#125
abdulbadii
closed
5 years ago
1
Language request: Kurdish-Kurmanji
#124
brandones
closed
5 years ago
22
remove 'tessedit_load_sublangs chi_tra' for korean
#123
Shreeshrii
closed
6 years ago
0
[info] OCR Ground Truth Resources
#122
amitdo
opened
6 years ago
0
Remove parameter textord_tabfind_vertical_horizontal_mix
#121
stweil
closed
6 years ago
3
Fix file mode (remove execute permission)
#120
stweil
closed
6 years ago
0
Documentation of how to contribute
#119
avelino
opened
6 years ago
0
Portuguese (Brazil) source
#118
avelino
opened
6 years ago
0
copy files from kur subdirectory
#117
Shreeshrii
closed
6 years ago
0
Next