AtlasOfLivingAustralia / ala-downloads

Data downloads
https://downloads.ala.org.au
1 stars 4 forks source link

Pseudo-replicated fields in DwC download (again) #28

Open Mesibov opened 5 years ago

Mesibov commented 5 years ago

(See also https://github.com/AtlasOfLivingAustralia/ala-downloads/issues/17) In the Full Darwin Core download I did yesterday (https://doi.org/10.26197/5bd2c5b89be4a) there are 6 pseudo-duplicated fields. Below are their fieldnames, descriptions from headings.csv and comments.


scientificName scientificName

"scientificName","raw_taxon_name","scientificName","raw_taxon_name","The full scientific name, with authorship and date information if known. When forming part of an Identification, this should be the name in lowest level taxonomic rank that can be determined. This term should not contain identification qualifications, which should instead be supplied in the IdentificationQualifier term.","raw_taxon_name","Scientific Name - original","http://rs.tdwg.org/dwc/terms/scientificName" "scientificName","taxon_name","scientificName","taxon_name","The full scientific name, with authorship and date information if known. When forming part of an Identification, this should be the name in lowest level taxonomic rank that can be determined. This term should not contain identification qualifications, which should instead be supplied in the IdentificationQualifier term.","taxon_name","Scientific Name","http://rs.tdwg.org/dwc/terms/scientificName"

One field has (mainly) names with authors. The other has bare names, but not just canonicalName, because many entries are blank.

--

establishmentMeans establishmentMeans

"establishmentMeans","cultivated","establishmentMeans","cultivated","The process by which the biological individual(s) represented in the Occurrence became established at the location. Recommended best practice is to use a controlled vocabulary.","cultivated","","http://rs.tdwg.org/dwc/terms/establishmentMeans" "establishmentMeans","establishment_means","establishmentMeans","establishment_means","The process by which the biological individual(s) represented in the Occurrence became established at the location. Recommended best practice is to use a controlled vocabulary.","establishment_means","","http://rs.tdwg.org/dwc/terms/establishmentMeans"

The 2 fields have differing content:

18368
1649 Cultivated 6 Possibly cultivated 104 Presumably cultivated

18040
1973 cultivated 4 native; cultivated 1 native; possibly cultivated 5 possibly cultivated 104 presumably cultivated

--

verbatimLatitude verbatimLatitude

"verbatimLatitude","raw_latitude","verbatimLatitude","raw_latitude","The verbatim original latitude of the Location. The coordinate ellipsoid, geodeticDatum, or full Spatial Reference System (SRS) for these coordinates should be stored in verbatimSRS and the coordinate system should be stored in verbatimCoordinateSystem.","raw_latitude","Latitude - original","http://rs.tdwg.org/dwc/terms/verbatimLatitude" "verbatimLatitude","verbatim_latitude","verbatimLatitude","verbatim_latitude","The verbatim original latitude of the Location. The coordinate ellipsoid, geodeticDatum, or full Spatial Reference System (SRS) for these coordinates should be stored in verbatimSRS and the coordinate system should be stored in verbatimCoordinateSystem.","verbatim_latitude","","http://rs.tdwg.org/dwc/terms/verbatimLatitude"

One is DMS, other is DD

verbatimLongitude verbatimLongitude

"verbatimLongitude","raw_longitude","verbatimLongitude","raw_longitude","The verbatim original longitude of the Location. The coordinate ellipsoid, geodeticDatum, or full Spatial Reference System (SRS) for these coordinates should be stored in verbatimSRS and the coordinate system should be stored in verbatimCoordinateSystem.","raw_longitude","Longitude - original","http://rs.tdwg.org/dwc/terms/verbatimLongitude" "verbatimLongitude","verbatim_longitude","verbatimLongitude","verbatim_longitude","The verbatim original longitude of the Location. The coordinate ellipsoid, geodeticDatum, or full Spatial Reference System (SRS) for these coordinates should be stored in verbatimSRS and the coordinate system should be stored in verbatimCoordinateSystem.","verbatim_longitude","","http://rs.tdwg.org/dwc/terms/verbatimLongitude"

One of the pseudo-replicates is in DMS, the other in DD.

--

maximumDepthInMeters maximumDepthInMeters

"maximumDepthInMeters","max_depth_d","maximumDepthInMeters","max_depth_d","The greater depth of a range of depth below the local surface, in meters.","max_depth_d","Maximum depth in meters","http://rs.tdwg.org/dwc/terms/maximumDepthInMeters" "maximumDepthInMeters","min_depth_d","maximumDepthInMeters","min_depth_d","The greater depth of a range of depth below the local surface, in meters.","min_depth_d","Minimum depth in meters","http://rs.tdwg.org/dwc/terms/maximumDepthInMeters"

One is minimumDepthInMeters.

--

verbatimCoordinateSystem verbatimCoordinateSystem

"verbatimCoordinateSystem","raw_datum","verbatimCoordinateSystem","raw_datum","The spatial coordinate system for the verbatimLatitude and verbatimLongitude or the verbatimCoordinates of the Location. Recommended best practice is to use a controlled vocabulary.","raw_datum","Geodetic datum - original","http://rs.tdwg.org/dwc/terms/verbatimCoordinateSystem" "verbatimCoordinateSystem","verbatim_coordinate_system","verbatimCoordinateSystem","verbatim_coordinate_system","The spatial coordinate system for the verbatimLatitude and verbatimLongitude or the verbatimCoordinates of the Location. Recommended best practice is to use a controlled vocabulary.","verbatim_coordinate_system","","http://rs.tdwg.org/dwc/terms/verbatimCoordinateSystem"

One of these fields is blank.