INTERSTAT / Statistics-Contextualized

Models for the dissemination of contextualized statistical data
6 stars 3 forks source link

Air Quality ontology and data models #21

Open pafrance opened 2 years ago

pafrance commented 2 years ago

This is a proposal to try model Air quality using existing vocabularies from SOSA for sensor description and AQD model for Air pollution interstat.pdf Yellow is related to SOSA concepts and green is related to AQD model. Bear in mind that this is the ontological description of the domain of interest regarding Air Pollution This model can be exported in OWL format with eddy.

Actual Data can be mapped by tools like monolith or juma, but some adjustments are needed to match with suggested smart model data structures The link contains a list of properties and concepts that have been analyzed to solve the compatibility issue Items highlighted in green have been added in the graphical representation while the yellow ones are proposed for revision. Not highlighted Item have not been analyzed yet. Set of concepts like these pertains to the administrative elements of the sensor or to its physical environment that could be added to the concept model too. lista di concetti interstat.docx

Except from the missing areas regarding sensor physical environment, the main mismatch in the data models is about the pollutant structure: sensor data exhibits a vector of pollutant measurements that can be mapped to a given set of columns in a tabular representation, each of which represents concentration data, and thus is formatted as a float. Our model represents a single observation as a couple of key/value set, so multiple measurements translate to multiple rows pertaining to the same observation Is it possible to translate between the two models with a simple pivot/unpivot function.

After we reach consensus on a common data model, the next step is about mapping this model to actual data sources to produce the triplets for each source, but I'd like to discuss available datasets and the common model first. Italian datasets, which are already compliant to AQD and SOSA models are available for reference.

albertoabellagarcia commented 2 years ago

I guess that we are mapping 'different' things because individual measurements possibly are better mapped by the deviceMeasurement data model. Airquality data models reflect a 'full' picture.