MichaelFenwick / WomensScriptTransliterator

A project to transliterate text from English to the Women's Script featured in the Stormlight Archive novels.
0 stars 0 forks source link

Transliteration of Stormfather's dialog doesn't correctly work with `ch` digraph. #41

Open MichaelFenwick opened 1 year ago

MichaelFenwick commented 1 year ago

In the WoR chapter Striding the Storm (and probably other places as well), when the Stormfather speaks, the word child is transliterated with the ch being two separate characters rather than the single character it should be.

My initial guess is that this is caused by the C being a capital letter, which, in the character set used by the Stormfather, is causing issues. It might even be the case that there are formatting tags around that letter alone, separating it from the rest of the word.

MichaelFenwick commented 6 days ago

On examination, it appears that this is caused by the C and hild not being a singular word, but rather one which is split apart with a span. This is a case where atomic word transliteration is needed. In the ideal case, the C would be replaced with the women's script ch glyph, while the following span would be turned into simply ild.

The following is the HTML of the paragraph where this issues is evident.

<p class="TX">C<span class="ePub-SC-1">HILD OF</span> H<span class="ePub-SC-1">ONOR.</span> It spoke to him this time. Last time, in the middle of the storm, it had not—though it had done so in dreams.</p>