bbcarchdev / spindle

RES Linked Open Data aggregation engine
https://bbcarchdev.github.io/spindle/
Apache License 2.0
2 stars 1 forks source link

Ingested data inaccessible as labels and titles are missing #118

Open townxelliot opened 7 years ago

townxelliot commented 7 years ago

Some resources ingested by RES lose important textual data during ingestion. This means that these resources are effectively invisible, as there's no way to find them with a text search.

Two examples from the live system:

REMARC

For the resource http://remarc.acropolis.org.uk/1930_Childhood_ClassMetalWork, dct:title is specified in the source RDF (which Acropolis has ingested, at least partially). Terry and I checked that the namespace given for dct matches the one in Acropolis (it does).

The resource is in Acropolis: http://acropolis.org.uk/?uri=http://remarc.acropolis.org.uk/1930_Childhood_ClassMetalWork but the ingested version has no labels or titles; which means no one can find it with a search.

BBC TEACH

http://bbcteach.acropolis.org.uk/b03g6vz4 has the same problem: the label isn't indexed, so the data can't be searched.

There is an rdfs:label in the source data, but it's not in Acropolis. You can see what the live index does have at: http://acropolis.org.uk/?uri=http://bbcteach.acropolis.org.uk/b03g6vz4