Closed keab42 closed 7 months ago
Ah. I thought I'd checked all the closed issues thoroughly, but this is a duplicate of https://github.com/jsvine/markovify/issues/84
Should be easy enough to adjust my input texts to exclude ones with only one sentence.
I'm experimenting with importing a variety of different texts for a pet project.
I've been working on consuming and sanitising some of the text, but with NTLK tagging activated, I've run into what appears to be an issue parsing text that contains brackets, square brackets, double and single quotes.
I can easily remove these from the text, but it would be nice to be able to preserve this punctuation if possible.
I did try something like str.replace("[", "["), but that did not seem to help.
For example:
Input string: "[Babbles back] Sixty seconds."
Error stack trace:
My POSifiedText class looks like this: