Closed Aariq closed 2 years ago
This is a great example for class.There is every version possible. My name has an accent in portuguese but not in spanish, so sometimes it was typed in with or without. All combinations are there for names that do have accents (Ze, Ze') plus the usual array of misspelled names (Phil and Phill). The record holder may "Joao Deus, Juan De Deis, Juan De Dios, and Juan de Deus"...none of which are his name (João de Deus). To say nothing of the nicknames (Carlos is from Peru, so of course we was immediately nicknamed Machu Pichu. It was quickly shortened to Machu. All three are there (Carlos, Machu, and Machu Pichu). I'm dying.
Leave it the way it is. Maybe one day I'll take a stroll down memory lane and Open Refine them., Just add a note they need to be cleaned.
There are likely a lot of duplicated observer names due to typos. I don't know if this is worth trying to fix, but might be worth documenting.
Created on 2021-04-07 by the reprex package (v0.3.0)