Closed ampudia19 closed 1 year ago
In regards to the validation utils, I completely agree that it eventually needs splitting. In fact, most functions there correspond to the labelling approaches (and using it to label our taxonomy with entities or journals). I'll rename it to "labelling.py", and create other py files as per your request for subsequent work, if necessary.
@emily-bicks I pushed some additional code (relating to the histograms, mostly). I was thinking of directly merging to dev since you already reviewed its legacy code, but maybe you want to have a quick peak again. Otherwise I can merge :)
Taxonomy validation (Phase 1)
Description
The first set of validation activities includes visualisation tools as well as two approaches to labelling topics (entity labelling and weighted journal labelling). Plots include sunbursts and a first set of histograms. For details, see the README file in
pipeline/taxonomy_validation
.Fixes #12
Checklist:
notebooks/
pre-commit
and addressed any issues not automatically fixeddev
README
s