nestauk / dap_aria_mapping

Mapping technology innovation to support The Advanced Research and Innovation Agency (ARIA)
MIT License
1 stars 0 forks source link

10 cooccurrence taxonomy #33

Closed emily-bicks closed 1 year ago

emily-bicks commented 1 year ago

Description

Builds pipeline to generate taxonomy using term cooccurrences

Fixes #11

Instructions for Reviewer

To test: run scripts with flag --test to run on small sample of dataset. NOTE: if test_mode is not used, running the script will take a long time and overwrite the outputs on AWS.

Please pay special attention to: pipeline/

SOME THINGS TO NOTE WITH TAXONOMY THAT MAY REQUIRE ITERATION:

There are also new getters:

Checklist: