Closed tevzselcan closed 1 year ago
... which indicates you ignored previous errors (=> list.train was not created) IMO Makefile should stop/fails soon in cases of previous errors...
Ok so I redid everything and got this when running make training once https://pastebin.com/0uZGxPEB but still get the error at the end
python3 shuffle.py 0 "data/foo/all-lstmf"
/bin/bash: line 1: bc: command not found
/bin/bash: line 4: bc: command not found
+ head -n '' data/foo/all-lstmf
head: invalid number of lines: ''
+ tail -n '' data/foo/all-lstmf
tail: invalid number of lines: ''
make: *** [Makefile:191: data/foo/list.train] Error 1
please install bc
(basic calculator) - I will put it into Readme.md
Thank you! That worked. Just two quick questions though, would it be possible to train Tesseract to recognize symbols like Ω, α etc. so the symbols that appear quite often in physics, and could Tesseract be trained using only letters, so the transcriptions would be like A a B b C d... ?
See part "adding the plus-minus sign (±) to the existing English model". Even it is not mentioned in tesseract 5 training, the process described there should work.
I tried to train with the sample data provided here (https://github.com/tesseract-ocr/tesstrain/blob/main/ocrd-testset.zip). I extracted the contents to data/foo-ground-truth and ran make training but got this error:
I'm running this on Ubuntu 20.04.