dataspelunking / MLwR

Machine Learning with R
226 stars 419 forks source link

Unicode / Mojibake errors in CSV #2

Closed edent closed 6 years ago

edent commented 6 years ago

For example on https://github.com/dataspelunking/MLwR/blob/master/Machine%20Learning%20with%20R%20(2nd%20Ed.)/Chapter%2004/sms_spam.csv#L79

Allo! We have braved the buses and taken on the trains and triumphed. I mean we€˜re in b€˜ham. Have a jolly good rest of week

Should be

Allo! We have braved the buses and taken on the trains and triumphed. I mean we're in b'ham. Have a jolly good rest of week

There's a few issues like this. Would you accept a PR to fix them?

dataspelunking commented 6 years ago

Absolutely! I've tried hunting for all the unicode / ASCII issues but I never seem to catch them all. If you have a fix, I'd happily accept a PR. Thanks!