In the chapter on preprocessing, NLTK's PunktWordTokenizer is used directly (input 11). This no longer seems to work in NLTK version 3.0.3. In fact, this word tokenizer was not supposed to be used in the first place. Maybe it should be removed from the tutorial?
In the chapter on preprocessing, NLTK's PunktWordTokenizer is used directly (input 11). This no longer seems to work in NLTK version 3.0.3. In fact, this word tokenizer was not supposed to be used in the first place. Maybe it should be removed from the tutorial?