Living-with-machines / CensusGeocoder

Geocode Historic Great British Census Data 1851-1911
MIT License
1 stars 0 forks source link

Check permissions/copyright status of desired outputs #11

Closed josh-rhodes closed 1 week ago

josh-rhodes commented 2 years ago

Indicative format of outputs below. @claireaustin01, I will add in more details of the fields in due course.

josh-rhodes commented 2 years ago

OS Open Roads Revised Output

road_id_01 geometry name1 nameTOID new_id conparid_01-11 CEN_1901
osgb4000000006295134_109960.0_5610001 MULTILINESTRING Z ((378225 600232 0, 378266 600240 0, 378347.64 600244.38 0, 378471 600251 0), (3758 FOREST DRIVE osgb4000000006295134 109960.0_5610001 109960 5610001
osgb4000000006295135_109958.0_5610001 MULTILINESTRING Z ((378191.36 603573.6900000001 0, 378226.19 603658.0600000001 0, 378317 603878 0, 3 COTTONSHOPE ROAD osgb4000000006295135 109958.0_5610001 109958 5610001

FYI, name1 which has the name of the street e.g. 'FOREST DRIVE' etc is from OS Open Roads not the census

josh-rhodes commented 2 years ago

GB1900 Revised Output

pin_id final_text osgb_east osgb_north geometry new_id conparid_01-11 CEN_1901
57f3c2a72c66dcab3d0098dd ALBERT EMBANKMENT 530514.0869 179055.9553 POINT (530514.0869470719 179055.955260041) 100003.0_240002 100003 240002
57f131f52c66dca3220174dd VAUXHALL 530337.2164 178135.2528 POINT (530337.2163969669 178135.252814792) 100003.0_240002 100003 240002
57f132412c66dca322017523 BROAD STREET 530536.0609 178693.4169 POINT (530536.060890616 178693.416869754) 100003.0_240002 100003 240002
josh-rhodes commented 2 years ago

Census Revised Output

RecID pin_id road_id_01
1   osgb4000000030483049_100001.0_10101
2   osgb4000000030483049_100001.0_10101
3   osgb4000000030483049_100001.0_10101
4   osgb4000000030483049_100001.0_10101

FYI If these people (RecIDs 1-4) had been linked to Gb1900, they would have an entry in 'pin_id', but since they haven't been linked there's no entry. So some people will have a pin_id, some a road_id_01, and some both, depending on which source they've been linked to.

claireaustin01 commented 2 years ago

Claire's notes:

SN7856 - comprises of names, addresses, unique identifier (requires special licence permission via UKDS) SN7481 - comprises other generic census information e.g. parish, district, employment (available to UKDS users without special licence permission) both studies need to be combined in order to access full census information

We have linked unique identifier from SN7856 with GB1900 & OS Open Roads to create 'Josh's census dataset' note - unique ID has been released by other studies without the requirement for permission to reuse (e.g. SN8600)

OS Open Roads - Open Government Licence terms https://www.nationalarchives.gov.uk/doc/open-government-licence/version/2/ GB1900 via Visions of Britain website - CC-BY-SA

we want to release on: UK Data Service BL Repository (if possible)

will release code separately but code will not give ability to reconstruct names and addresses without SN7856 special permission

claireaustin01 commented 2 years ago

@josh-rhodes GB1900 Revised Output new ID column refers to street ID?