weka511 / nlp

My experiments with Natural Language Processing. I've created a few programs to try out concepts.
GNU General Public License v3.0
1 stars 0 forks source link

Preprocess data #19

Closed weka511 closed 1 year ago

weka511 commented 1 year ago

Get rid of errors such as _'charmap' codec can't encode character '\u200a' in position 5: character maps to

_ and punctuation marks
weka511 commented 1 year ago

I haven't seen a recurrence since previous commit