celiafi / pipeline-mod-celia

Celia specific modules for the DAISY Pipeline 2
0 stars 0 forks source link

Symbols missing from liblouis table #28

Open jukkae opened 8 years ago

jukkae commented 8 years ago

Missing symbol: ” ” right double quotation mark. Quite probably left one, too, as well as possibly some other quotation marks.

jukkae commented 8 years ago

Table fixes depend a bit on Celia's future strategy, as well as the evolution of the system as a whole. It can be reasonably expected that new UTF-8 symbols will continue appearing for quite a long time. The question is, then, a) should every new symbol be added to the tables whenever they appear, b) should unknown symbols be replaced during procurement (either the subcontractor, or in Celia with e.g. Oxygen or whatever), or c) would it be possible to add some sort of interactive step to the system: "Encountered these symbols unknown to Liblouis, do you want to replace them with another symbol?"

bertfrees commented 8 years ago

This kind of interactive step could be build into pre-processing. SBS does something similar but with whole words instead of single characters. Verified words are added to a dictionary before transcription of the book happens. You could also build something in the system itself, but it wouldn't be interactive. You would show a warning or an error

jukkae commented 8 years ago

Split the issue into symbols missing from liblouis and elements missing from css (see https://github.com/snaekobbi/pipeline-mod-celia/issues/44).