cldf / segments

Unicode Standard tokenization routines and orthography profile segmentation
Apache License 2.0
31 stars 13 forks source link

Using plain "?" as exception marker conflicts with e.g. X-SAMPA #32

Closed xrotwang closed 6 years ago

xrotwang commented 6 years ago

Since ? is used as regular glyph in X-SAMPA, it shouldn't be used by default as exception marker.

tresoldi commented 6 years ago

Right. Luckily, I can't think of a common system which uses # (even though there probably is one). I can take care of this after the current bug fix.

xrotwang commented 6 years ago

I started to use U+FFFD REPLACEMENT MARKER for this purpose instead of ?.

tresoldi commented 6 years ago

Great! Was just about to ask if we shouldn't move to a specific Unicode charEm 16 de abr de 2018 5:03 PM, Robert Forkel notifications@github.com escreveu:I started to use � U+FFFD REPLACEMENT MARKER for this purpose instead of ?.

—You are receiving this because you were assigned.Reply to this email directly, view it on GitHub, or mute the thread.