Closed jaklinger closed 4 years ago
Towards addressing #326
What NiH don't tell you is that the core_project_num field has a "base" project number, which is effectively the actual core project number. This is essential for aggregating projects, otherwise it will appear that many duplicates exist in the data.
core_project_num
Can test the functionality by doing:
luigi --module nih_impute_base_id RootTask
from nesta/core/routines/datasets/nih
nesta/core/routines/datasets/nih
Towards addressing #326
What NiH don't tell you is that the
core_project_num
field has a "base" project number, which is effectively the actual core project number. This is essential for aggregating projects, otherwise it will appear that many duplicates exist in the data.Can test the functionality by doing:
luigi --module nih_impute_base_id RootTask
from
nesta/core/routines/datasets/nih