Open BobSimons opened 3 years ago
Thanks. Just adding a note here that the encoding issues listed here are present in the UTF-8 encoded data files on IPT, so the issue will need to be fixed at the data provider or node level. The respective nodes have been notified and I'll try to come up with a report to help the node managers identify issues.
In the 2021-05-18 occurrence.csv file, there are a large number of probably incorrect character sequences and invalid characters. I'll just guess that these stem from the characterSet being incorrectly set/handled when you imported the data from the source. Here are 3 examples from when scientificName="Ablennes hians":
The data for many other scientificNames has similar problems. Can you please track down and fix these data problems? Thank you.