Open LienReyserhove opened 5 years ago
Some intensive cleaning is needed for the dates.
On request of @stijnvanhoey and @damianooldoni , I made a branch eventDate_mapping
. The file can be found here
It appears to be empty, but it's not :-)
I will try to figure out how to clean them myself, but help is always welcome (and already provided thanks to @stijnvanhoey)
I suspect this part of DAISIE was not well reviewed when being collated. I think we need some rules to exclude data that does not make sense. I suggest at least the following:
Given the long discussions we've had about scope I don't suppose these records can be left out altogether. I really hate unbounded records. They tend to get interpreted as the taxon always being present, whereas frequently the reverse is true. If you feel you have to be included them, then you could use the occurrenceStatus of doubtful, which seems appropriate in these cases.
When inspecting date information, the following dates are odd:
start_year
for 174 records and starting from the year -7000 (!)end_year
for 124 records and starting form the year -6000With respect to 1 and 2, I can hardly imagine these species to be alien as the are introduced many many years ago. I would suggest to leave eventDate information empty for the records in 1, 2 and 3. This only affects about 200 distributions (a total of 56000 distributions = 0.35%)