saulalbert / unixclan

Utility scripts for TalkBank's CLAN
0 stars 0 forks source link

CAlite2CHAT should lowercase all turn-initial TCUs (in turn-beginnings and turn-incoming overlaps) #43

Closed saulalbert closed 6 years ago

saulalbert commented 6 years ago

Lowercase any turn-initial utterance. This should include the first character in any line following a speaker ID and a colon/tab combination.

e.g.:

*PS002: You enjoyed yourself in America?

becomes

*PS002: you enjoyed yourself in America?

Also, where there is a turn-incoming overlap, the turn-incoming (i.e. second line) overlap should have the first character capitalized

e.g.:

*PS002: Yes oh Jim 's in Flint this afternoon at the Hart and <Straw> [>]
        Club .
*PS006: <Hmm:> [<]. 

becomes

*PS002: yes oh Jim 's in Flint this afternoon at the Hart and <Straw> [>]
        Club .
*PS006: <hmm:> [<].