Open nisbet-hubbard opened 7 months ago
So improved training is necessary. Do you know a freely available computer font which emulates that design? Or is there a ground truth data set which can be used to train recognition of that font?
Yes, there is! There’s two fonts with this sort of theta and rho, both under the open font licence.
GFS Heraklit: the text in the image probably used the italic of this. Scroll down for the download.
GFS Artemisia: in a slightly different style.
OCR result: ϑεοὶ γὰρ οὔποτ᾽,
This is an ordinary book font used by editions of classical texts. Because the design of its theta, however, this letter is frequently OCR’ed as a swash form and requires manual correction as it stands out from the rest of the text when rendered in other (esp. sans) Greek fonts.