IATI / refresher

A Python application which has the responsibility of tracking IATI data from around the Web and refreshing the core IATI software's data stores
GNU Affero General Public License v3.0
2 stars 0 forks source link

solrize stage produces error on datasets that have empty <iati-identifier> elements in them, leaves dataset unprocessed #335

Closed simon-20 closed 2 months ago

simon-20 commented 2 months ago

Brief Description The solrize stage of the pipeline crashes on datasets that have empty elements in them.

(It's unclear whether the other stages of the pipeline work correctly for this).

Severity High

Issue Location

This dataset contains multiple IATI activities which have empty <iati-identifier> elements:

https://iatiregistry.org/api/action/package_show?id=ca7347c3-de5f-49dc-b1c7-0b345190f83e

The Solrize stage on both the dev and production instance crash when trying to Solrize this dataset.

Steps to Reproduce

Expected Results/Behaviour Allocate the activity an IATI Identifier before putting it into Solrize, so that there is an activity ID that can be used by the Pipeline and Solr itself.

Actual Results/Behaviour Solrize stage crashes.