Open M-Nicholls opened 3 years ago
For each filter needed establish:
wherever an assertion is used to filter, check that that the assertion is operating correctly e.g. https://biocache.ala.org.au/occurrences/search?q=assertions:RECORDED_DATE_INVALID
many of the records appear to have valid dates: https://biocache.ala.org.au/occurrences/4eaa0bd1-5bb0-4e40-9452-31e8afb0a040
... so that the data set that is trusted
Identify and filter incomplete records:
https://github.com/AtlasOfLivingAustralia/DataQuality/issues/249
Identify and filter invalid records:
Values are
Identify and filter potentially incorrect records:
Automated identification of incorrect records:
spatial outlier detection
environmental outlier detection https://github.com/AtlasOfLivingAustralia/DataQuality/issues/250
taxonomic outlier detection (scientific names not in the names lists) https://github.com/AtlasOfLivingAustralia/DataQuality/issues/252
Manually identify and filter incorrect records
Identify and filter duplicate records
Identify and filter less authoritative records
Identify not fit for purpose records