codeforIATI / iati-ideas

đź’ˇ Ideas for new codeforIATI projects and blogs
https://ideas.codeforiati.org
0 stars 0 forks source link

[PROJECT IDEA] XM-DAC organisation lists #25

Open stevieflow opened 3 years ago

stevieflow commented 3 years ago

Rationale

Useful for getting consistent org identifiers, which are essential for publication of networked IATI data

Proposal

Publish a list of identifiers minted from the Channel Codes of the OECD DAC CRS list: http://www.oecd.org/development/financing-sustainable-development/development-finance-standards/dacandcrscodelists.htm

Specifically:

stevieflow commented 3 years ago

(also - in doing this, set up something (@andylolz has something?) to periodically synch ....)

markbrough commented 3 years ago

Maybe this is potentially some kind of extension of this? https://org-id-finder.codeforiati.org/

matmaxgeds commented 3 years ago

Whatever the scraper/generator comes up with, would be great if org-id could suck it in - I guess we might be able to come up with several sources to scrape for org_IDs.......I wondered about scraping a list of all the orgs mentioned in IATI without an org_ID. I also wondered about some tool to link orgs across the different lists where they are the same org - that is on the heuristic de-duplication list of things to do

stevieflow commented 3 years ago

Thanks @markbrough @matmaxgeds

I'm minded to keep this limited in scope / application to begin with.

We know that the OECD Purpose Code and Agency (which is the bases for this list, which is actually incorrect: https://codelists.codeforiati.org/OrganisationIdentifier/) lists are useful to IATI publishers in order to describe other organisations successfully.

The workflow, however (download list; find entity; mint XM-DAC identifier) is long-winded and an unneeded overhead. If we can generate and (auto) maintain a reference list, then it can indeed have all sorts of applications. But for now - I'd just get the list in place....

andylolz commented 3 years ago

Just adding a note… Agencies are available as JSON here: https://datahub.io/core/dac-and-crs-code-lists/r/agencies.json

Channel codes are available as JSON here: https://datahub.io/core/dac-and-crs-code-lists/r/channel-codes.json

stevieflow commented 3 years ago

The OECD now have an update of the lists in XML: https://www.oecd.org/dac/financing-sustainable-development/development-finance-topics/crs-xml.htm

For the "Channel of delivery" , there's (handily) a status field and activition-date:

<codelist-item status="active" activation-date="2015-01-01" mcd="MCD"><code>12004</code><name><narrative>Other public entities in recipient country</narrative><narrative xml:lang="fr">Autres entité publique dans le pays bénéficiaire</narrative></name><description><narrative></narrative><narrative xml:lang="fr"></narrative></description><category>12000</category><dac:reference>DCD/DAC/STAT(2015)14/REV1</dac:reference></codelist-item>
<codelist-item status="active" activation-date="1998-01-01" mcd="non-MCD"><code>21001</code><name><narrative>Association of Geoscientists for International Development </narrative><narrative xml:lang="fr">Association de géoscientifiques pour le développement international</narrative></name><description><narrative></narrative><narrative xml:lang="fr"></narrative></description><category>21000</category> <dac:coefficient>100</dac:coefficient></codelist-item>
stevieflow commented 3 years ago

@andylolz can this be added to the Projects board, please?

markbrough commented 3 years ago

This would be cool! I also heard mention of Public Bodies again the other day... made me think back to Tim's paper on govt organisation identifiers!

In terms of how to implement this: