ESIPFed / SOI_harmonization

The data harmonization repository for Soil Ontology and Informatics cluster.
MIT License
6 stars 0 forks source link

SOI_harmonization

The data harmonization repository for the ESIP Soil Ontology and Informatics cluster.

Roadmap

1) Describe current soil data products for a set of key measurement types (see Issues) 2) Link measurement names to existing ontologies or extend those ontologies 3) Develop a hierarchical understanding of the methodologies (new ontologies/knowedge maps?) 4) Work up a work flow that combines proposed data descriptions and a generalized script to generate an integrated data product

Ways to contribute:

1) Identify soil data products for evaluation via https://github.com/ESIPFed/SOI_harmonization/issues/2 2) Describe how layer location (geolocation, depth, and sampling time) are described in the data product https://github.com/ESIPFed/SOI_harmonization/issues 3) Attend SOI Cluster meetings

Propsed data descriptors

The purpose of these proposed data descriptors are to provide a sufficent description of datasets that will enable a general harmonizaiton script to merge datasets into a common database. We are assuming the data is presented as a relational database. Take from https://github.com/ESIPFed/soil_data_model_survey/tree/master/data

1) data_structure: Description of the columns in a database

data_product data_table data_column data_type
ISCN3 layer 13c (‰) value_number
ISCN3 layer 14c (‰) value_number
ISCN3 layer 14c_sigma (‰) sigma
ISCN3 layer 14c_age (BP) value_number
ISCN3 layer 14c_age_sigma (BP) sigma

2) data_meta: Descriptions of the metadata that applies to the entire table

data_product data_table data_column data_type entry
ISCN3 layer 13c (‰) unit permille
ISCN3 layer 14c (‰) unit permille
ISCN3 layer 14c_sigma (‰) unit permille
ISCN3 layer 14c_age (BP) method 14C age model
ISCN3 layer 14c_age (BP) unit BP

3) thesaurus: Description mapping provided variable to a common vocabulary

variable_location data_product data_table provided_variable variable
column_name ISCN3 layer 13c (‰) 13c
column_name ISCN3 layer 14c (‰) 14c
column_name ISCN3 layer 14c_sigma (‰) 14c