alephdata / followthemoney

Data model and processing tools for investigative entity data
https://followthemoney.tech
MIT License
217 stars 53 forks source link

OpenOwnership enricher #172

Closed augusto-herrmann closed 2 years ago

augusto-herrmann commented 4 years ago

A process for enriching a followthemoney entity and its relationships by looking it up in the OpenOwnership Register. From their website:

Freely licensed data, in bulk

We release all of our data as an open ledger, formatted using the Beneficial Ownership Data Standard (BODS).

This data is updated monthly, and made available as a free download under the Open Data Commons Attribution License.

Currently containing nearly 20 million BODS statements, in a 10GB JSONLines file.

Navigating the maze of corporate shell companies that own one another is a big challenge for investigative reporting and money laundering investigations. Beneficial ownership data seems to be especially useful for following the money on a project like aleph.

pudo commented 4 years ago

Hey @augusto-herrmann, long time no see. I'd love to see a PR that could convert OpenOwnership data to Follow The Money entities, it'd be super useful to have as more jurisdictions of OO data go online (so far it's been a pretty limited set, and the OO web site mixes up all the sources which makes much harder to use in practice).

The way this could be implemented could be analogous to the OCDS importer we have. This is simply a command-line option like this:

cat opentender.eu.json | ftm import-ocds | ftm aggregate >opentender.ftm.json

The code for this is quite simple.

pudo commented 2 years ago

This is WIP here now: https://github.com/opensanctions/bods-ftm

pudo commented 2 years ago

I don't think there's actually an open action on this, closing as done.

augusto-herrmann commented 2 years ago

Cool! Good job, @pudo.