joemgon / worldcup2018

Awesome Stats
1 stars 1 forks source link

adding files for fuzzy matching between squad data and fifa data #5

Closed michaelpawlus closed 6 years ago

michaelpawlus commented 6 years ago

Here are some files for doing fuzzy matching between the squad data and the fifa data.

I used this the csvdedupe package

I created a config.json file as shown on the github README

Then ran: csvlink portugal_squad.csv portugal_players.csv --config_file=config.json

Just swap in the names for your csv files and edit the config.json (though may not be necessary) if you want to try with a different squad.

In Portugal tests, it caught everyone except José Fonte. I might try to see if I can get it to catch that or else we might have to do a few manual joins but this could merge 95% of cases hopefully.

joemgon commented 6 years ago

Thanks Mike, reviewing now