LOD-GEOSS / databus-snippets

3 stars 1 forks source link

Annotation possibilities #5

Open 0UmfHxcvx5J7JoaOhFSs5mncnisTJJ6q opened 2 years ago

0UmfHxcvx5J7JoaOhFSs5mncnisTJJ6q commented 2 years ago

We (@giannou and I) looked at how table-metadata-mapping/out.csv is linked to the OEO in oep_metadata/turbine_metadata.jsonld. But the wide format (one column per variable) is … awkward for us. For one thing, we have many variables. The data for the Fifth Assessment Report had 148, that for the Special Report on 1.5°C had 598, and our model currently reports 2267. For another, people that already work with IAMC-style data are used to different formats, and they would need to change their behaviour in some way to adopt the Databus. We would try to minimise the need for the former to maximise the chances for the latter.

So, is it possible to annotate data by row, not by column? For the example data below, what we would ideally have is a file linking e.g. the variable Primary Energy|Gas|w/ CCS to the appropriate class in the OEO.

model scenario region variable unit period value
WITCH-GLOBIOM 3.1 SSP2-26 R5LAM Primary Energy|Gas|w/ CCS EJ/yr 2010 0
REMIND-MAgPIE 1.7-3.0 CD-LINKS_NPi2020_400 World AR5 climate diagnostics|Temperature|Global Mean|MAGICC6|MED °C 2056 1.64109
IMAGE 3.0.2 EMF33_WB2C_nobeccs R5REF Primary Energy|Gas|w/ CCS EJ/yr 2000 0
REMIND-MAgPIE 1.7-3.0 SMP_1p5C_Def R5MAF Emissions|CO2|Energy and Industrial Processes Mt CO2/yr 2035 2704.4925
IMAGE 3.0.1 IMA15-RenElec R5LAM Investment|Energy Supply|Electricity|Coal|w/o CCS billion US$2010/yr 2070 0
POLES ADVANCE ADVANCE_2020_Med2C World AR5 climate diagnostics|Forcing|Tropospheric Ozone|MAGICC6|MED W/m2 2041 0.363274
WITCH-GLOBIOM 3.1 SSP4-Baseline World Secondary Energy|Electricity|Biomass|w/ CCS EJ/yr 2070 0.00016
WITCH-GLOBIOM 4.2 ADVANCE_INDC R5OECD90+EU Secondary Energy|Electricity|Coal|w/o CCS EJ/yr 2090 0
C-ROADS-5.005 Ratchet-1.5-allCDR World AR5 climate diagnostics|Temperature|Global Mean|FAIR|P90|Non CO2 °C 2097 0.388
WITCH-GLOBIOM 4.2 ADVANCE_Reference R5MAF Secondary Energy|Electricity|Coal|w/ CCS EJ/yr 2005 0.000001
yum-yab commented 2 years ago

This is currently not possible. The reason is:

Somewhere needs to be a mapping of variable to oeo class. Currently this happens in a modified version of the metadata of the datasets. An example can be seen here. So for example the column manufacturer (line 285) is connected to the oeo class OEO:00000045. Since the data in the rows is not included in the table metadata (since the metadata contains only information about the columns, not the content), in the current way this is implemented there can't be a way to annotate the rows, only the columns.

0UmfHxcvx5J7JoaOhFSs5mncnisTJJ6q commented 2 years ago

This is currently not possible.

So it will be in the future? Is anybody working on that?

Also, finally an answer after two years ;).