Open ebergelson opened 4 years ago
this appears to be bc of the default replace = T...still feels like a bug but maybe is error in database?
dogtest <- get_tokens(token = "dog", replace = F)
unique(dogtest$gloss)
output: [1] "Dog" "dog"
Almost certainly a problem with the way we parse replacements (which is the most complicated thing in parsing CHILDES, esp. the interaction of replacement tokens with the annotations for disfluencies and reformulations). We'll look into it.
simplest eg:
dogtest <- get_tokens(token = "dog")
unique(dogtest$gloss)
output: [1] "Dog" "dog" "laughing" "dog's"