GSS-Cogs / HO-asylum

0 stars 0 forks source link

Old statistical geographies used #3

Open ajtucker opened 5 years ago

ajtucker commented 5 years ago

The final SPARQL tests on PMD are failing, see https://ci.floop.org.uk/job/GSS_data/job/Migration/job/HO-asylum/107/testReport/(root)/tests_pmd/All_pmd_codesUsed_must_have_a_label__for_PMD_dataset_search_/

Some geographic codes are being used that don't have corresponding labels in PMD. These codes appear to be old ONS codes for local authorities that no longer exist.

We need to figure out:

LPerryman commented 4 years ago

Updated gdp-vocabs repository file reference-geography.ttl with the 8 digit geography codes throwing errors as described in document: https://docs.google.com/document/d/1dcYMlKOUOO4L5yCTci3v-pwbTXGLKl1gzrO1aSWvhNE/edit#heading=h.zdum4ljuxv

LPerryman commented 4 years ago

Only the following 4 digits codes and other-and-unknown throwing an error: These are codes assigned to places pre merger with others

http://statistics.data.gov.uk/id/statistical-geography/35UB http://statistics.data.gov.uk/id/statistical-geography/35UC http://statistics.data.gov.uk/id/statistical-geography/35UD http://statistics.data.gov.uk/id/statistical-geography/39UB http://statistics.data.gov.uk/id/statistical-geography/15UB http://statistics.data.gov.uk/id/statistical-geography/15UC http://statistics.data.gov.uk/id/statistical-geography/35UE http://statistics.data.gov.uk/id/statistical-geography/13UB http://statistics.data.gov.uk/id/statistical-geography/20UB http://statistics.data.gov.uk/id/statistical-geography/13UC http://statistics.data.gov.uk/id/statistical-geography/13UD http://statistics.data.gov.uk/id/statistical-geography/20UD http://statistics.data.gov.uk/id/statistical-geography/20UE http://statistics.data.gov.uk/id/statistical-geography/20UF http://statistics.data.gov.uk/id/statistical-geography/13UE http://statistics.data.gov.uk/id/statistical-geography/46UB http://statistics.data.gov.uk/id/statistical-geography/15UD http://statistics.data.gov.uk/id/statistical-geography/13UG http://statistics.data.gov.uk/id/statistical-geography/09UC http://statistics.data.gov.uk/id/statistical-geography/15UE http://statistics.data.gov.uk/id/statistical-geography/39UC http://statistics.data.gov.uk/id/statistical-geography/46UC http://statistics.data.gov.uk/id/statistical-geography/39UD http://statistics.data.gov.uk/id/statistical-geography/15UF http://statistics.data.gov.uk/id/statistical-geography/15UG http://statistics.data.gov.uk/id/statistical-geography/46UD http://statistics.data.gov.uk/id/statistical-geography/20UG http://statistics.data.gov.uk/id/statistical-geography/39UE http://statistics.data.gov.uk/id/statistical-geography/09UE http://statistics.data.gov.uk/id/statistical-geography/39UF http://statistics.data.gov.uk/id/statistical-geography/20UH http://statistics.data.gov.uk/id/statistical-geography/35UF http://statistics.data.gov.uk/id/statistical-geography/13UH http://statistics.data.gov.uk/id/statistical-geography/35UG http://statistics.data.gov.uk/id/statistical-geography/20UJ http://statistics.data.gov.uk/id/statistical-geography/46UF http://statistics.data.gov.uk/id/statistical-geography/other-and-unknown

LPerryman commented 4 years ago

rows with 3/4 digit geographies or 'other-and-unknown' have been removed from the dataset in script Asylum seekers receiving support(As_16_q).py. Pipeline now runs ok but leaving this issue open for now.