inbo / natuurpunt-natagora-checklist

Waarnemingen.be / observations.be - List of species observed in Belgium
MIT License
0 stars 0 forks source link

Export files do not use UTF-8 encodings #5

Open peterdesmet opened 1 month ago

peterdesmet commented 1 month ago

The datasetName is:

Waarnemingen.be /�observations.be�- List of species observed in Belgium

Is it possible to replace this value in the SQL with regular spaces?

And some scientificName have:

Epichlo� baconii

This seems to be caused by the fact that the data has the ISO-8859-1 encoding rather than the UTF-8 encoding. When exporting the csv file from the database, please define the encoding as UTF-8.

peterdesmet commented 1 month ago

Epichloë baconii is now written correctly, but the datasetName still has characters around observations.be that are not spaces. Notice the dots (for spaces) vs the no-dots around observations.be. I can correct this manually, but would be good to resolve this at the source:

Screenshot 2024-08-02 at 14 32 31