Closed jennet closed 4 years ago
Dimension http://gss-data.org.uk/def/dimension/flow
includes a codelist that is an error and should not be in the data: http://gss-data.org.uk/def/concept-scheme/migration-directions
I cannot find any reference to it in components.csv
There also appears to be two different dimensions, Flow and Flow Directions, that use the same code list. Not sure if this is intentional.
@ajtucker mentioned this morning that this was likely caused by jenkins not flushing the components graph when that pipeline was re-run.
We should just be using flow-direction in the trade datasets, see GSS-Cogs/family-trade#62
All trade datasets have been updated to use flow directions rather than flow
migration-directions code list is still coming through the extract as attached to the flow dimension
The extraction scripts retrieve the migration-direction code list and extend the flow dimension to include this as a code list on these datasets:
ons-uk-trade-in-goods-by-industry-country-and-commodity should now be fixed.
ons-international-exports-of-services-from-subnational-areas-of-the-uk is showing as correctly having 'Flow Directions' upon output, would you be able to double check this one?
I think I've found and removed all the stragglers now. We're not using multiple codelists per dimension in the trade datasets currently.
PREFIX qb: <http://purl.org/linked-data/cube#>
SELECT * WHERE {
?d a qb:DataSet ;
qb:structure / qb:component / qb:dimension ?dim .
?dim qb:codeList ?cl1, ?cl2 .
FILTER (?cl1 != ?cl2 && regex(str(?d), "trade")) .
}
Some data issues are due to stale data in the PMD3 db, i.e. the data error has been fixed in the configuration, but this has not removed erroneous triples from the underlying database, and so they are in turn pulled through into PMD4.