How to train Chinese tradtional vertical in Tesseract 5?

tesseract-ocr / tesstrain

Train Tesseract LSTM with make

Apache License 2.0

599 stars 178 forks source link

How to train Chinese tradtional vertical in Tesseract 5? #389

Open fishfree opened 1 month ago

fishfree commented 1 month ago

This is the screenshot from jTessBoxEditor: The provided example training files in this repo seems building a whole line of image & text pairs, other than character-by-character. Then my questions are:

How to efficiently split a single image into mulitple vertical lines of text?
By which directon do we write the vertical text in text files, L2R or R2L?

fishfree commented 1 month ago

For Q1, I noticed the answer here.