weso / hercules-ontology

Development of the Ontology and its Continuos Integration for the Hercules project.
https://herculescrue.github.io/ib-hercules-ontology/current/asio.html
GNU General Public License v3.0
0 stars 5 forks source link

generate-&-publish 'subject areas' vertical module #95

Closed spitxa closed 4 years ago

spitxa commented 4 years ago

i was given a new document (attached) including the subject areas (áreas temáticas) used by the agencia estatal de investigación (depending on the ministerio de econonomía, insdustria y competitividad). these subject areas are slightly different from the scientific domains (although related) and are mandatory (through descriptors) for any business with agencia estatal de investigación (projects, subsidies, financial helps, etc.).

a new vertical module, also SKOS-based is to be produced to enrich the ontological model.

areas_tematicas_AEI.pdf

spitxa commented 4 years ago

first data-preparation manœuvres to be done: -analyse the source document to establish levels, descriptors by level, labels, textual descriptions, etc. -export data from the source document (pdf) to tabular data into the separate levels discovered -translate the spanish labels in (at least) english and also official languages (and non official)

following steps: -code the tabels programs -generate the rdf datasets in SKOS

spitxa commented 4 years ago

done: -exported data from the source document to tabular data -coded tabels program to transform the data -label localisation into an, ast, ca, en, es, eu, ext, gl, fr, oc, pt, ca-ipa, es-ipa, en-gb-ipa, en-us-ipa -generated & added preliminary version of the subject-areas vertical module, including so far just the 1st level.

to be done: -still working on the 2nd & 3rd levels

spitxa commented 4 years ago

done: -exported data from the source document to tabular data -coded tabels program to transform the data -label localisation into ast, ca, es, en, gl, fr and pt -generated & added preliminary version of the subject-areas vertical module, including already the 1st and 2nd levels.

to be done: -still working on the last level: 3rd

spitxa commented 4 years ago

done: -exported data from the source document to tabular data -label localisation into ca, es, en, gl, fr and pt

to be done: -disambiguate codes (see attached screenshot) belonging to different levels that are identical

Captura de Pantalla 2020-04-22 a les 16 22 20

-code tabels program to transform the data -label localisation into ast -generate 3rd-level dataset -merge 3rd-level to 1-st & 2nd levels and update vertical module

spitxa commented 4 years ago

done: -disambiguated codes (see attached screenshot) belonging to different levels that were identical: Captura de Pantalla 2020-04-23 a les 15 53 10 (a digit 3 was added instead of the third letter of the codes)

-coded tabels program to transform the data -localised label in ast -generated 3rd-level dataset -merged 3rd-level to 1-st & 2nd levels and update vertical module

spitxa commented 4 years ago

all committed and running.