w3c / elreq

Ethiopic Layout Requirements
Other
10 stars 4 forks source link

A switch for Ethiopic wordspace to space & vice-versa would be useful #117

Open r12a opened 4 years ago

r12a commented 4 years ago

There should be a way to automatically convert word-space characters to ordinary spaces, and vice versa.

The layout requirements document has an algorithm that can be used to convert between the two.

r12a commented 4 years ago

The first comment in this issue contains text that will automatically appear in the Ethiopic gap-analysis document as a subsection with the same title as this issue. Any edits made to that comment will be immediately available in the document. Proposals for changes or discussion of the content can be made in comments below this point.

fantasai commented 4 years ago

Looking it over, it seems like the algorithm is primarily doing fixup, which is good for editing software but maybe less appropriate for a CSS feature. But we could certainly add a text-transform for switching between ethiopic word space and regular spaces. Once you figure out what you want to do with nbsp, feel free to file an issue against CSS Text 4. :)

dyacob commented 4 years ago

I'll check with @andjc to see if he would be interested in taking on the transform, I can prepare some test cases.

dyacob commented 4 years ago

Attached is the introduction of መዝገበ፡ፊደል፡ by ኪዳነ፡ወልደ፡ክፍሌ፡ (Mezgebe Fidel by Kidane Wolde Kifle) originally published in 1934 (Gregorian) and republished in 1965. The work is historically significant and politically neutral. It should be good for testing this issue, #116 , #122 , and #127. Unfortunately the end of the 2nd paragraph is odd, it appears like some text is missing though it properly reflects the printed version. While odd, it would present a special corner case for wordspace substitution. I'll add some additional samples and will keep a look out for more corner cases.

MezgebeFidel-Introduction-KidaneWoldeKifle-With-Wordspace.txt MezgebeFidel-Introduction-KidaneWoldeKifle-With-Space.txt