vipulnaik / donations

Donations list website (DLW): a repository for keeping track of public donations by some people I (arbitrarily) decide to track
https://donations.vipulnaik.com
Creative Commons Zero v1.0 Universal
20 stars 2 forks source link

Systematically cover grant process #11

Open riceissa opened 6 years ago

riceissa commented 6 years ago
riceissa commented 6 years ago

It seems like for grant stage, one rule that should be followed is "if there is only one grant for a (donee, donor) combination, then it must either be a planning or initial grant". I think the following query checks for this (i.e. it should be empty if the rule is followed):

select grant_stage,url
from donations
where grant_stage is not NULL
    and (donee,1) in (select donee,count(*) from donations
        where donor='Open Philanthropy Project' group by donee)
    and (grant_stage != 'planning grant' and grant_stage != 'initial grant');
riceissa commented 6 years ago

Grant stage stuff is now at https://github.com/riceissa/open-phil-grant-stage because I decided to do it using a script.

riceissa commented 6 years ago

'Thoroughness of grant application or review process' is also now covered in riceissa/open-phil-grant-stage (I guess the repo is now misnamed, oh well).

riceissa commented 6 years ago

Predictions are done. See the following commits:

riceissa commented 6 years ago

For "Some field giving structured or free-form information on the expected use of the grant money" I would like to see some examples. Does this correspond to the "Purpose" field on a grant page? Or some made-up phrase?

If we take https://www.openphilanthropy.org/focus/global-catastrophic-risks/potential-risks-advanced-artificial-intelligence/george-mason-university-research-future-artificial-intelligence-scenarios as an example, some possible values I can think of are:

riceissa commented 6 years ago

By the way, the grant-process branch in this repo is now mostly obsolete because I decided to do the grant stage stuff via the script linked above rather than by hand. It might still be useful as a way to double-check that field, but isn't intended to be pulled into master anymore.

riceissa commented 6 years ago

I'm trying to use some hacky regex tricks to detect the expected use of grant money. Here are the results so far (see the expected_money_use column): https://github.com/riceissa/open-phil-grant-stage/blob/master/data.csv

riceissa commented 6 years ago

Also as you can see in the CSV, grabbing the purpose from the grant page is very easy and clean. The problem with the purpose is that in my view, in most cases it doesn't capture the most useful information about how the money will be used.

riceissa commented 6 years ago

I've been thinking more about the expected use of grant money one. My guess is that the most useful one to include would be an enum that we can then do group by queries on, so that we can say things like "X% of Open Phil grants are for research, Y% are for campaigns, %Z are for prize money, and so on". But to do this well would probably require manually entering the values or at least manually checking the outputs from a script.

My sense is that nobody besides you is even asking these questions, and I'm not sure if anybody else would begin to care if running such queries became cheap. In chat you said (any of) the sample values I provided in the comment above are fine, which leads me to believe that even you haven't thought through this in sufficient detail to know precisely what you want out of this. That makes me feel suspicious about going ahead with a tedious manual task.

For the free-form version, I think the free-form format contains too much information for a human to process by reading the page, and if one were to focus on a small subset of grants, then one could already check this manually pretty easily. So if we just grab the "purpose" field from the grant page that would be easiest, but I'm not sure the value added would be high.