Closed josh-chamberlain closed 1 year ago
I can start scoping this out. How is each metric gathered now? Are they regularly published anywhere else? How often do you want them updated on the site?
@milespratt those metrics were gathered once, manually, and added to the deck.
PRs merged → GitHub API? Community Members Introduced → new people in Discord? Datasets Scraped → this one's tough, but super important. Datasets Documented → this one's easy enough to glean from our datasets repo and tables therein, particularly "Agencies" and "Datasets": https://www.dolthub.com/repositories/pdap/datasets
So maybe a better version of this issue would recommend that we could use a proof of concept for getting a simple metric from the Dolt API to our site: https://docs.dolthub.com/dolthub/api
DoltHub is where we'll be making the most progress in the near term in terms of experiments and bounties.
Other ideas for metrics:
datasets
addedagencies
updatedn states
n agencies
(we only have a few examples of data-intake so far)I can start scoping this out. How is each metric gathered now? Are they regularly published anywhere else? How often do you want them updated on the site?
We can get a (rough) dataset count by counting python scripts in the scraper's repo
excluding config
related files of course.
As for update intervals, we can just run it on merge
Community Members Introduced → new people in Discord?
Discord supports widgets for websites. See their blog post on it here
PRs merged → GitHub API?
GitHub also has webhooks for this if it makes this any easier. GitHub webhooks
We're considering using the Notion API or Django here. Notion is configurable without code, and one less app to deploy. Django is more customizable.
GitHub
DoltHub
Dolt SQL API: https://docs.dolthub.com/dolthub/api