Refactor of data collection OA code and new GtR modules
Description
In this PR, I've included a major refactor of our data collection code for OpenAlex (OA), and introduced new modules for data collection and processing for Gateway to Research (GtR).
Key Changes
OA Data Collection Code Refactor:
I've restructured the OA data collection code to enhance its modularity and functionality. This includes moving some parts to utils.py.
Introduction of GtR Data Collection and Processing:
New modules for fetching and processing GtR data have been added.
Specific Requests
Jack's Review on OA Code:
Jack, could you please check the functionality of the old OA code, especially the parts I've moved around? I've tried to make sure nothing breaks, but you never know. Adding some tests on your code may help in the future.
Additionally, there's a bit of redundancy in the code, particularly between the two similar request functions. It would be great if you could look into yours, see whether your code can use mine instead.
Harmonisation of Hierarchical Naming:
I've noticed that the hierarchical naming in both parameters and catalog names could be better aligned. Currently, I've kept yours as top-level, but ideally, they should be nested under <oa>. See the other parameters for reference.
Any suggestions on restructuring these for consistency and clarity is appreciated :)
Refactor of data collection OA code and new GtR modules
Description
In this PR, I've included a major refactor of our data collection code for OpenAlex (OA), and introduced new modules for data collection and processing for Gateway to Research (GtR).
Key Changes
OA Data Collection Code Refactor:
utils.py
.Introduction of GtR Data Collection and Processing:
Specific Requests
Jack's Review on OA Code:
Harmonisation of Hierarchical Naming:
<oa>
. See the other parameters for reference.