OHDSI / GIS

https://ohdsi.github.io/GIS
Apache License 2.0
8 stars 9 forks source link

Develop method for API data sources #127

Open kzollove opened 1 year ago

kzollove commented 1 year ago

NOTE: This issue should be refactored before work begins. Example datasets/API calls: SDG Indicator database ACS 5 Year (site) USGS Daily Values (site) India government API (site)

kzollove commented 1 year ago

https://api.dhsprogram.com/#/index.html

kzollove commented 1 year ago

Some starting points/ thoughts for defining this method/best practice:

Best practices for granularity:

Spatially - a data source should not be subdivided past the level of an entire country

Temporally - a data source should not be subdivided past the level of an entire year

Examples:

The DHS API has a bunch of other fields on which to filter. I think we need to play around with the country-year level first to see the file size range of these datasets, but I am hopeful we can give a hard stop at the country-year line for data sources. All of those fields that are returned in the dataset are potential variable sources, as I see it.

Another thing moving forward, should we restrict our scope of potential API sources to those that offer CSV output? Just at the beginning while we gather our bearings, maybe.

kzollove commented 1 year ago

@tibbben would love to get your insight on this. @rtmill and I started a conversation with a big group last week, but it ended pretty pessimistically. We know you are already implementing API downloads. We are cautiously hopeful that we can develop a slightly more dynamic (less hardcoded) solution if we try really hard. That said, your work would be a really solid starting point as we are currently shooting in the dark. Maybe a meeting for the three of us in the next week or two?