Open cameronneylon opened 4 years ago
Yes, this looks doable. The individual pages for the grants look like you could programmatically access them once you have the grantid. The actual page structure itself looks amenable to using beautiful soup to parse and turn back into structured data.
I think this is something can could slot into PoC3 perhaps (see The-Academic-Observatory/observatory-platform#50)
Name: ARC Funding Data Subject Area: Funding Harvest Type: Paged JSON Query Type: via API Snapshot Type: Snapshot Frequency:
The ARC provides a JSON API for funding since 2001 at https://dataportal.arc.gov.au/NCGP/Web/Grant/Grants This is a paged JSON API which only provides summary and lead investigator name rather than further details but we could presumably link a lot of it up and/or scrape the additional data from the web interface. The JSON data does seem to be well structured and fairly straightforward to incorporate.
Alternately we could maybe ask for a data dump of this?
By contrast NHMRC data is made available as xlsx spreadsheets at: https://www.nhmrc.gov.au/funding/data-research/outcomes-funding-rounds