This PR adds two scripts that can be used for collecting grant information from the NIH RePORTER API:
reporter_project_query.py: queries the RePORTER API projects endpoint for grant information and returns the results as a CSV
clean_reporter_results.py: takes the CSV from RePORTER, removes outdated and redundant entries, and returns a CSV
While the scripts were designed for the specific purpose of collecting subproject_id values, the input arguments and JSON payload content in reporter_project_query.py can be modified to capture other information available from the RePORTER API. The extractions and filtering criteria in clean_reporter_results.py can also be adjusted, if needed. As such, I imagine these scripts can be extended to capture most or all of the GrantView and ProjectView metadata we record.
This PR adds two scripts that can be used for collecting grant information from the NIH RePORTER API:
While the scripts were designed for the specific purpose of collecting subproject_id values, the input arguments and JSON payload content in reporter_project_query.py can be modified to capture other information available from the RePORTER API. The extractions and filtering criteria in clean_reporter_results.py can also be adjusted, if needed. As such, I imagine these scripts can be extended to capture most or all of the GrantView and ProjectView metadata we record.