Closed akasaka99 closed 2 years ago
You already got the answer: brew install tesseract-lang
installs all available languages and scripts.
PS. I use MacOS.
Please use the Tesseract user forum for questions.
Sorry I don't understand. From the prompt it says English, OSD (???) and SNUM (???) ... im not sure what these are. I cant get Chinese for instance and on linux you have it, so definitely not all languages
Listing of successful installation of all languages (including Chinese):
% brew install tesseract-lang
Running `brew update --preinstall`...
==> Auto-updated Homebrew!
Updated 3 taps (homebrew/core, homebrew/cask and homebrew/services).
==> New Formulae
libxcvt mbt tradcpp
==> Updated Formulae
Updated 267 formulae.
==> New Casks
avifquicklook dcp-o-matic-disk-writer medis miniwol mx-power-gadget yattee
dcp-o-matic-combiner dcp-o-matic-playlist-editor miln-movie-splitter mp3tag tomatobar
==> Updated Casks
Updated 127 casks.
==> Downloading https://ghcr.io/v2/homebrew/core/tesseract/manifests/5.1.0
######################################################################## 100.0%
==> Downloading https://ghcr.io/v2/homebrew/core/tesseract/blobs/sha256:9ccac58d048015f4a413b77e101efb3dd887705013c8171cff4f1de291dd56d3
==> Downloading from https://pkg-containers.githubusercontent.com/ghcr1/blobs/sha256:9ccac58d048015f4a413b77e101efb3dd887705013c8171cff4f1de291dd56d3?se=2022-05-10T13%3A50%3A00Z&sig=Z8D27bPZHfE0RnsjcWjGndCMJTez2cYB
######################################################################## 100.0%
==> Downloading https://ghcr.io/v2/homebrew/core/tesseract-lang/manifests/4.1.0
######################################################################## 100.0%
==> Downloading https://ghcr.io/v2/homebrew/core/tesseract-lang/blobs/sha256:589b4e7851f76924cf8bd77155f53ffda95bb92cbb19327aed1766000a203760
==> Downloading from https://pkg-containers.githubusercontent.com/ghcr1/blobs/sha256:589b4e7851f76924cf8bd77155f53ffda95bb92cbb19327aed1766000a203760?se=2022-05-10T13%3A50%3A00Z&sig=UbBEQFNWHdETvH6Kd2bBWqZE%2BEW2p%
######################################################################## 100.0%
==> Installing dependencies for tesseract-lang: tesseract
==> Installing tesseract-lang dependency: tesseract
==> Pouring tesseract--5.1.0.arm64_monterey.bottle.tar.gz
🍺 /opt/homebrew/Cellar/tesseract/5.1.0: 58 files, 30.0MB
==> Installing tesseract-lang
==> Pouring tesseract-lang--4.1.0.all.bottle.tar.gz
🍺 /opt/homebrew/Cellar/tesseract-lang/4.1.0: 164 files, 653.9MB
==> Running `brew cleanup tesseract-lang`...
Disable this behaviour by setting HOMEBREW_NO_INSTALL_CLEANUP.
Hide these hints with HOMEBREW_NO_ENV_HINTS (see `man brew`).
stweil@notebook11 ocrd_detectron2 % tesseract --list-langs
List of available languages in "/opt/homebrew/share/tessdata/" (163):
afr
amh
ara
asm
aze
aze_cyrl
bel
ben
bod
bos
bre
bul
cat
ceb
ces
chi_sim
chi_sim_vert
chi_tra
chi_tra_vert
chr
cos
cym
dan
deu
div
dzo
ell
eng
enm
epo
equ
est
eus
fao
fas
fil
fin
fra
frk
frm
fry
gla
gle
glg
grc
guj
hat
heb
hin
hrv
hun
hye
iku
ind
isl
ita
ita_old
jav
jpn
jpn_vert
kan
kat
kat_old
kaz
khm
kir
kmr
kor
kor_vert
lao
lat
lav
lit
ltz
mal
mar
mkd
mlt
mon
mri
msa
mya
nep
nld
nor
oci
ori
osd
pan
pol
por
pus
que
ron
rus
san
script/Arabic
script/Armenian
script/Bengali
script/Canadian_Aboriginal
script/Cherokee
script/Cyrillic
script/Devanagari
script/Ethiopic
script/Fraktur
script/Georgian
script/Greek
script/Gujarati
script/Gurmukhi
script/HanS
script/HanS_vert
script/HanT
script/HanT_vert
script/Hangul
script/Hangul_vert
script/Hebrew
script/Japanese
script/Japanese_vert
script/Kannada
script/Khmer
script/Lao
script/Latin
script/Malayalam
script/Myanmar
script/Oriya
script/Sinhala
script/Syriac
script/Tamil
script/Telugu
script/Thaana
script/Thai
script/Tibetan
script/Vietnamese
sin
slk
slv
snd
snum
spa
spa_old
sqi
srp
srp_latn
sun
swa
swe
syr
tam
tat
tel
tgk
tha
tir
ton
tur
uig
ukr
urd
uzb
uzb_cyrl
vie
yid
yor
Hello, I understand you only have Windows and Linux but since MacOS is linux based and has repos on homebrew, I installed version 5..1.0 with homebrew with the following command
brew install tesseract-lang
and I got the messageThis formula contains only the "eng", "osd", and "snum" language data files. If you need any other supported languages, run brew install tesseract-lang
. Now eventhough it works fine for English, I cannot install all the other languages, would any know by any chance the command line to install all the languages as clearly the one I used does not install them all whereas it works perfectly for English. Many thanks!