tdwg / bdq

Biodiversity Data Quality (BDQ) Interest Group
https://github.com/tdwg/bdq
43 stars 7 forks source link

TG2-VALIDATION_TAXON_NOTEMPTY #105

Open iDigBioBot opened 6 years ago

iDigBioBot commented 6 years ago
TestField Value
GUID 06851339-843f-4a43-8422-4e61b9a00e75
Label VALIDATION_TAXON_NOTEMPTY
Description Is there a value in any of the terms needed to determine that the taxon exists?
TestType Validation
Darwin Core Class dwc:Taxon
Information Elements ActedUpon dwc:taxonID
dwc:scientificNameID
dwc:acceptedNameUsageID
dwc:parentNameUsageID
dwc:originalNameUsageID
dwc:taxonConceptID
dwc:scientificName
dwc:higherClassification
dwc:kingdom
dwc:phylum
dwc:class
dwc:order
dwc:superfamily
dwc:family
dwc:subfamily
dwc:tribe
dwc:subtribe
dwc:genus
dwc:genericName
dwc:subgenus
dwc:infragenericEpithet
dwc:specificEpithet
dwc:infraspecificEpithet
dwc:vernacularName
dwc:cultivarEpithet
Information Elements Consulted
Expected Response COMPLIANT if at least one term needed to determine the taxon of the entity exists and is bdq:NotEmpty; otherwise NOT_COMPLIANT
Data Quality Dimension Completeness
Term-Actions TAXON_NOTEMPTY
Parameter(s)
Source Authority
Specification Last Updated 2023-09-18
Examples [dwc:taxonID="", dwc:scientificNameID="", dwc:acceptedNameUsageID="", dwc:parentNameUsageID="", dwc:originalNameUsageID="", dwc:taxonConceptID="", dwc:scientificName="Eucalyptus gunnii", dwc:higherClassification="", dwc:kingdom="", dwc:phylum="", dwc:class="", dwc:order="", dwc:superfamily="", dwc:tribe="", dwc:subtribe="", dwc:family="", dwc:genus="", dwc:subgenus="", dwc:specificEpithet="", dwc:infraspecificEpithet="", dwc:vernacularName="" : Response.status=RUN_HAS_RESULT, Response.result=COMPLIANT, Response.comment="at least enough terms exist that identify that an entity exists"]
[dwc:taxonID="", dwc:scientificNameID="", dwc:acceptedNameUsageID="", dwc:parentNameUsageID="", dwc:originalNameUsageID="", dwc:taxonConceptID="", dwc:scientificName="", dwc:higherClassification="", dwc:kingdom="", dwc:phylum="", dwc:class="", dwc:order="", dwc:superfamily="", dwc:tribe="", dwc:subtribe="", dwc:family="", dwc:genus="", dwc:subgenus="", dwc:specificEpithet="", dwc:infraspecificEpithet="", dwc:vernacularName="": Response.status=RUN_HAS_RESULT, Response.result=NOT_COMPLIANT, Response.comment="All input fields are bdq:Empty or missing"]
Source Lee Belbin
References
Example Implementations (Mechanisms)
Link to Specification Source Code
Notes This tests for records that have no taxonomic (NAME) information. If there is any value for any of the Information Elements, this may be useful information. See example.
iDigBioBot commented 6 years ago

Comment by Lee Belbin (@Tasilee) migrated from spreadsheet: This needs discussion. For example, would kingdom, phylum, class or order only be useful?

Tasilee commented 2 years ago

Added new Darwin Core taxon terms to Information Elements. Are there any unnecessary ones?

chicoreus commented 2 years ago

Following the definitions of the higher taxonomy terms being placements for the taxon, not the taxon, and as expresse in #57 and #70, the information elements in this test should exclude the higher taxonomy terms, and should reflect just the list as used in #57 and #70: dwc:taxonID, dwc:scientificName, dwc:genericName, dwc:specificEpithet, dwc:infraspecificEpithet, dwc:scientificNameAuthorship and dwc:cultivarEpithet. Perhaps also including dwc:taxonConceptID, and dwc:scientificNameID. This shorter list are the terms needed to "determine the taxon of the entity exists", others are placements for the taxon or relationships to other taxa.

chicoreus commented 2 years ago

Discussed in a call this summer, position is analogous to a date with just a day, there isn't sufficient information to know which day is being talked about, but there is some information that constrains the date. Thus even just a value in kingdom (a classification term) is adequate to provide some constraints on the taxon. Thus retain the current list of information elements, not try to reduced.

chicoreus commented 1 year ago

Will need to include the new terms dwc:superfamily, dwc:tribe, dwc:subtribe https://github.com/tdwg/dwc/issues/65 https://github.com/tdwg/dwc/issues/45 https://github.com/tdwg/dwc/issues/46

Tasilee commented 1 year ago

Added the terms dwc:superfamily, dwc:tribe, dwc:subtribe to the Information elements.

chicoreus commented 1 year ago

Updated the date last updated to 2022-07-04 as the list of information elements has changed.

Tasilee commented 1 year ago

Amended Specification Last Updated from "2022-07-04" to "2023-07-04" :)

Tasilee commented 1 year ago

Splitting bdqffdq:Information Elements into "Information Elements ActedUpon" and "Information Elements Consulted".

Also changed "Field" to "TestField", "Output Type" to "TestType" and updated "Specification Last Updated"