nestauk / dap_aria_mapping

Mapping technology innovation to support The Advanced Research and Innovation Agency (ARIA)
MIT License
1 stars 0 forks source link

9 semantic taxonomy #36

Closed ampudia19 closed 1 year ago

ampudia19 commented 1 year ago

Semantic Taxonomy

Description

This PR includes all require code and utils to conduct one of several semantic clustering approaches, namely:

Single clustering outputs are created from make_taxonomy.py, which multiple clustering outputs (and the associated meta_cluster object) are created using make_taxonomies.py (which does not have argparse properties). See README in pipeline/semantic_taxonomy for more details.

Pipelines for embeddings generation are also included in this PR.

Fixes #9

Checklist: