saulalbert / unixclan

Utility scripts for TalkBank's CLAN
0 stars 0 forks source link

CHAT2CAlite converts glottal stop marks (ʔ) into normal question marks (?) #19

Closed saulalbert closed 6 years ago

saulalbert commented 6 years ago

CHECK doesn't allow mid-TCU question marks, so we'll use a different unicode character where a question mark would appear in the middle of a turn (ʔ)

(U+0294 : LATIN LETTER GLOTTAL STOP)

So we can convert:

a mid-TCU high-riseʔ where the turn continues ---> mid-TCU high-rise? where the turn continues

So

helloʔ how are you

becomes

hello? how are you

saulalbert commented 6 years ago

NB: I hope this isn't already in progress. I've changed it after re-checking my list and re-thinking a few things after ou meeting. I think instead of doubling up symbols, we can just use new unicode characters (which pass CHECK without us having to make any upstream changes or worrying about double symbols).