nestauk / dap_aria_mapping

Mapping technology innovation to support The Advanced Research and Innovation Agency (ARIA)
MIT License
1 stars 0 forks source link

12 validate taxonomy #35

Closed ampudia19 closed 1 year ago

ampudia19 commented 1 year ago

Taxonomy validation (Phase 1)

Description

The first set of validation activities includes visualisation tools as well as two approaches to labelling topics (entity labelling and weighted journal labelling). Plots include sunbursts and a first set of histograms. For details, see the README file in pipeline/taxonomy_validation.

Fixes #12

Checklist:

ampudia19 commented 1 year ago

In regards to the validation utils, I completely agree that it eventually needs splitting. In fact, most functions there correspond to the labelling approaches (and using it to label our taxonomy with entities or journals). I'll rename it to "labelling.py", and create other py files as per your request for subsequent work, if necessary.

ampudia19 commented 1 year ago

@emily-bicks I pushed some additional code (relating to the histograms, mostly). I was thinking of directly merging to dev since you already reviewed its legacy code, but maybe you want to have a quick peak again. Otherwise I can merge :)