jalapic / engsoccerdata

English and European soccer results 1871-2022
759 stars 193 forks source link

Corrected some wrong utf-8 characters. #38

Closed tresoldi closed 7 years ago

tresoldi commented 7 years ago

There are some wrong characters in data-raw/leaguecup.csv: many hyphens for \x96, plus a couple of trailing \xa3 and \xa7 (left overs from a previous conversion?).

This diff is large to show, but just replaces the \x96 by hyphens and removes the other characters. It makes the file utf-8 compliant and can be loaded without tricks by some common libraries (in particular, pandas.read_csv()).

tresoldi commented 7 years ago

This also changes (second commit) the dates for Turkish matches from 2006 (reported as 2020).

jalapic commented 7 years ago

thank you!