alephdata / aleph

Search and browse documents and data; find the people and companies you look for.
http://docs.aleph.occrp.org
MIT License
2.03k stars 272 forks source link

FEATURE: Record and show the history of changes to an entity over time #2359

Open ozhyrenkov opened 2 years ago

ozhyrenkov commented 2 years ago

Is your feature request related to a problem? Please describe. The problem to solve can be briefly described as data dynamics. There are one-time loads like leaks of different kinds, but there are also datasets which can be regularly updated, including, but not limiting to:

In any of the aforementioned, but also in datasets of other kinds the data might change. For the Intervals it works fine, like sequence of successors or owners\directors. But for the changes of attributes there are no clear way to trace the history of changes. Examples of such kind:

Describe the solution you'd like A clear and easy to use way to see and trace the history of changes made to the entity of any kind:

Describe alternatives you've considered

Additional context I think that this supposed to be part not only of Aleph, but to some extend of followthemoney as a data standard. This can bring the whole ecosystem of FTM to a different level of analytics.

pudo commented 2 years ago

FWIW the way I've been doing this in OpenSanctions (on top of followthemoney is by using statement-based claim storage). This works well but a) I'm not sure it would scale to anywhere near where Aleph is in terms of data volume (the current OS db is 7.4mn statements for 220000 entities), and b) it does lead to a weird impedance when we merge it back down into normal FtM.

More info: https://www.opensanctions.org/docs/statements/

Maybe one day there could be qualified values in FtM, that also have metadata (per prop value) regarding source, timestamps, language, and even quality rank like in Wikidata.