nestauk / dap_aria_mapping

Mapping technology innovation to support The Advanced Research and Innovation Agency (ARIA)
MIT License
1 stars 0 forks source link

[64] Pipeline script for topic-level novelty #66

Closed beingkk closed 1 year ago

beingkk commented 1 year ago

Closes #64

The main addition is the pipeline for calculating a topic-level novelty score, characterising the "uncommonness" of the research related to a given topic (in a given year).

The usage is as follows:

python dap_aria_mapping/pipeline/novelty/openalex_topic_novelty.py

This will output five tables (on per each taxonomy level), with two alternative novelty scores per topic, per year.

At the moment, this is based only on the OpenAlex data.

In a forthcoming issue/PR, I will apply the same analysis on patent data, to generate novelty scores using patents as well.

Checklist:

beingkk commented 1 year ago

Thanks a lot @ampudia19, will implement your suggestions!

beingkk commented 1 year ago

Thanks again @ampudia19, I fixed the issues highlighted above, namely:

beingkk commented 1 year ago

Hope this is OK to merge @ampudia19 ?

Re topic names: I'm happy to add an adjustment to use chatgpt topic names via another issue #71 (perhaps once you've merged the corresponding PR). Hope that's alright?

ampudia19 commented 1 year ago

All is looking good, Karlis :)