slub / dfg-viewer

The DFG Viewer is a free web service for browsing digitized books from remote library repositories in a rich and dynamic environment.
https://dfg-viewer.de
GNU General Public License v3.0
29 stars 25 forks source link

Missing whitespace in ALTO output #147

Closed sebastian-meyer closed 4 years ago

sebastian-meyer commented 4 years ago

The DFG-Viewer relies on the existence of <SP> in the provided ALTO fulltext. Otherwise the fulltext is shown without any whitespace. Since <SP> is optional according to the ALTO XSD, this should be optional for the DFG-Viewer as well.

See https://github.com/UB-Mannheim/ocr-fileformat/issues/78 (especially https://github.com/UB-Mannheim/ocr-fileformat/issues/78#issuecomment-441122514)

sebastian-meyer commented 4 years ago

Fixed with #148