datasets / airport-codes

List of Airport codes, locations and other information around the world
https://datahub.io/core/airport-codes
306 stars 93 forks source link

Most recent datahub.io published version has encoding errors in CSV #37

Open joeshaw opened 3 years ago

joeshaw commented 3 years ago

The most recent CSV file available on https://datahub.io/core/airport-codes (from ~6 months ago) has unicode encoding errors.

One example, from a diff of an older version to the latest one:

-MMCC,small_airport,Ciudad Acuña New International Airport,1410,NA,MX,MX-COA,Ciudad Acuña,MMCC,ACN,EFN,"29.332899, -101.098998"
+MMCC,small_airport,Ciudad Acuña New International Airport,1410,NA,MX,MX-COA,Ciudad Acuña,MMCC,ACN,EFN,"-101.098998, 29.332899"

Looks like the source input was treated as Latin-1 rather than UTF-8?

luisdelarosa commented 1 year ago

I just ran the the process script on an M1 Mac and the row looks like this after the script completed:

MMCC,small_airport,Ciudad Acuña New International Airport,1410,NA,MX,MX-COA,Ciudad Acuña,MMCC,ACN,EFN,"29.333662, -101.100555"