IATI / refresher

A Python application which has the responsibility of tracking IATI data from around the Web and refreshing the core IATI software's data stores
GNU Affero General Public License v3.0
2 stars 0 forks source link

If a document has some activites that become not valid, they are removed from data store - can we keep them? #309

Open odscjames opened 10 months ago

odscjames commented 10 months ago

Scenario

Activity file with 6 activities, all of them great. They go in data store.

An update is done, one activity now has errors such that the validator marks it not valid. The clean stage will remove it.

Currently

The solrize stage will remove all old activities and put new ones in, so we'll be left with 5 good activities in the datastore.

Feature?

The solrize stage carefully does not remove the last good version of the activity that is now invalid. The datastore has the latest version of 5 good activites in, and an older version of 1 activity from a time when it was not invalid.

Can of worms

I can see this becoming a can of worms; could get very complex and for data removal policy reasons we need to make sure that if a activity is removed from a file it also is removed from the data store!