dgrtwo / tidy-text-mining

Manuscript of the book "Tidy Text Mining with R" by Julia Silge and David Robinson
http://tidytextmining.com
Other
1.31k stars 803 forks source link

Version of Pride & Prejudice from Project Gutenberg has "Chapter" issues #85

Open juliasilge opened 3 years ago

juliasilge commented 3 years ago

The version of Pride and Prejudice that gets pulled down from Project Gutenberg now has:

Right now, Sense and Sensibility works correctly, as an alternative.

juliasilge commented 2 years ago

There are also problems with War of the Worlds.

Consider adding a bit more explanation here about finding "Chapter" and looking at data in general.