Open neelsmith opened 1 year ago
Maybe just wrap this in a try
and don't sweat it if it fails? What are consequences of not stripping punct. out?
For 0.7 release, wrapping in a try
and warning. You can always preprocess the CitableTextCorpus
rather than relying on TextAnalysis
to remove punctuation.
See also #49 and #50 in preparing for a release supporting reproducible topic modeling
Generates deadly error on Greek text with multi-byte encoding, due to blind use of byte indexing rather than Unicode-aware processing?