Datasets returned from list datasets should _not_ be managed by dashboard-api

Problem: If we want the covid-api to be decoupled from it's application, like the covid-dashboard, we should off-load where static datasets are managed.

Datasets right now could potentially be sourced from:

github
s3
a metadata API (cough:stac:cough)

The main reason to use github is that github can manage versioning of datasets, which is good because dataset changes could impact dashboard and other API / services functionality. Versioning of datasets is useful in the case that you need to test a change to a dataset before deploying it to "production"

Challenges with github are that it requires redeploying the API when changes are made to datasets and forking the API if datasets are managed in this repo.

At this time, it doesn't seem possible to offload everything to a metadata API because the datasets endpoint does more than just return a list of datasets. It has a specific schema both for the way datasets are listed (e.g. using "_all", "global" and specific site keys) and for the datasets themselves (including information about how to visualize or provide a time series).

One proposed solution is to use a separate github repo for the static datasets used in any specific instance of the API.

Workflow would be as such:

Create a github repo to version your static datasets.
Create an S3 location to house your static datasets
Configure the covid API to read datasets from this S3 location (use caching so every call to /datasets doesn't require a call to S3).
Whenever a dataset change is merged to the "main" production branch of your datasets repo, new dataset versions are pushed to S3.

@drewbo @olafveerman @leothomas WDYT ⬆️

NASA-IMPACT / dashboard-api-starter

Datasets returned from list datasets should _not_ be managed by dashboard-api #2