AtlasOfLivingAustralia / DataQuality

Data Quality
0 stars 0 forks source link

Identify lower quality records... #245

Open M-Nicholls opened 2 years ago

M-Nicholls commented 2 years ago

... so that the data set that is trusted

Identify and filter incomplete records:

https://github.com/AtlasOfLivingAustralia/DataQuality/issues/249

Identify and filter invalid records:

Values are

Identify and filter potentially incorrect records:

Automated identification of incorrect records:

Manually identify and filter incorrect records

Identify and filter duplicate records

Identify and filter less authoritative records

Identify not fit for purpose records

M-Nicholls commented 2 years ago

For each filter needed establish:

wherever an assertion is used to filter, check that that the assertion is operating correctly e.g. https://biocache.ala.org.au/occurrences/search?q=assertions:RECORDED_DATE_INVALID

many of the records appear to have valid dates: https://biocache.ala.org.au/occurrences/4eaa0bd1-5bb0-4e40-9452-31e8afb0a040