OCR-D / gt-guidelines

OCR-D guidelines for Ground Truth production
https://ocr-d.de/en/gt-guidelines/trans/
Creative Commons Attribution Share Alike 4.0 International
6 stars 5 forks source link

Why all the content on Google #4

Closed kba closed 5 years ago

kba commented 5 years ago

E.g. https://lh4.googleusercontent.com/1HdqrPTGjE0nHay-YhnX16ipvQzmWW44oiWjsL4x9lh-aRq_J-flmV1oNOgflJFu5T-F9OEvKeQW8H1i_7gR7EMcF36wq1E8ktKO2fBWqLw2NDylG81YNE-Yt6DK8P599sZXajvP and many more.

tboenig commented 5 years ago

There is a workflow error in the creation of the DITA files. The images for the PAGE documentation are stored in the img directory and the images for the documentation, for example for the description of the transcription, are stored in the images directory.

kba commented 5 years ago

OK, can we clean this up? Maybe a hierarchy like

/
/images
/images/pagexml   <- generated images
/images/figures   <- snippets of glyphs and layout specimen etc.
/images/layout    <- Stuff like arrows, plus/minus buttons, borders etc. used for UI
/dita/pagexml       <- generated DITA about PAGE XML (?)
/dita/gt-guidelines <- The actual prose
/dita/gt-guidelines/en
/dita/gt-guidelines/de
/build              <- output and temp files, to be .gitignored
kba commented 5 years ago

obsolete