jvalhondo / spanish-names-surnames

Data set with Spanish names and surnames
Other
31 stars 32 forks source link

Data has NO accents!! #1

Open cach-dies opened 4 years ago

cach-dies commented 4 years ago

This dataset is great it how ever has two very large deficiencies.

  1. The data has no "Ñ" or "Ç" meaning all names containing these letters are shown incorrectly.

  2. All accented vowels have been replaced buy their un-accented counterparts (eg. Á shows as A).

While this dataset is great, these two deficiencies (the second one specially) greatly reduce it's usefulness. I understand the second issue is a deficiency from the source data its self as the INE data has no accents. If this could be fixed or if anyone knows of a dataset without this issue, please respond to this post. Thanks!

mnieves79 commented 5 months ago

I prefer this data set without accented vowels or special characters. I am adding this to my personal genealogy database in MySQL and special characters tend to cause problems when querying DB tables.