mysociety / whatdotheyknow-theme

The Alaveteli theme for WhatDoTheyKnow (UK)
http://www.whatdotheyknow.com/
MIT License
31 stars 26 forks source link

More comprehensive tagging of authorities with `dpr:nnnnnnnnn` #1291

Open MattK1234 opened 2 years ago

MattK1234 commented 2 years ago

We currently have 1113 authorities tagged with a DPR number, linked to the ICO's register of fee payers.

When tagged this produces a nice link for 'Data Protection Registration' in the right sidebar of the body page.

I've just tagged up a few hundred more via a CSV upload.

mdeuk commented 2 years ago

I've just tagged up a few hundred more via a CSV upload.

Thank you!

Q: Did you source the data manually (e.g. by querying the ICO DB), or did you use the (rather bulky) extract to do it?

MattK1234 commented 2 years ago

Q: Did you source the data manually (e.g. by querying the ICO DB), or did you use the (rather bulky) extract to do it?

I did that lot manually but for my next load I'm going to have a play around with the extract 😄

WilliamWDTK commented 2 years ago

I had a look at the extract before (December 2021). There is a column for public authorities in there, which is supposed to identify whether they are subject to the FOIA, but it's self-reported (as far as I'm aware) and sometimes inaccurate.

I generated a JSON file with all records tagged public authorities, it was 60MB. Unfortunately, I can't find the script I used to do so. It might have been one of the occasions where I used the terminal, which wasn't a particularly wise move.

RichardTaylor commented 1 year ago

Might be worth clarifying why we are doing this:

mdeuk commented 1 year ago

Might be worth clarifying why we are doing this:

  • Does it help us keep the body database up-to-date?

I’d say it does - but also, it’s similar to other database linkages, there is a benefit to our users in provision of links to open data.

  • Are we interested in bodies declaring they're subject to FOI in their DPR register entries?

Truthfully, i'd be more interested if it was more than a tickbox on a form - there’s a good bit of inconsistency in this data, which is a shame, as it would actually be very useful to have had an authoritative source from the ICO to refer to. Perhaps this is a useful lobbying point?

  • Does it help Search Optimisation of our pages?

I think the only benefit could be if users were searching based on the ICO data protection fee payer number - as we don’t present it in a visible format. If we thought it would be beneficial to display the number itself, we probably could, it’s not a very long thing.

MattK1234 commented 1 year ago

I'd agree it's similar to how/why we tag schools with urn: etc.

It provides a method for someone to click through to the data protection register entry from the WDTK body page. I doubt it's used much.

In doing this I found a few different things:

Separately, it gave me a chance to play with the csv import etc a bit more, as I haven't used it in a while

I'm not suggesting we go through and tag every body, but it's maybe something we can be mindful of when we create new bodies or update existing ones.