ariddell / tatom

Quantitative Text Analysis for the digitale Geisteswissenschaften
https://de.dariah.eu/tatom/
47 stars 17 forks source link

Using PunktWordTokenizer #14

Open bacor opened 9 years ago

bacor commented 9 years ago

In the chapter on preprocessing, NLTK's PunktWordTokenizer is used directly (input 11). This no longer seems to work in NLTK version 3.0.3. In fact, this word tokenizer was not supposed to be used in the first place. Maybe it should be removed from the tutorial?