mysociety / parlparse

The scraper/parser that produces data for TheyWorkForYou, PublicWhip, etc
Other
61 stars 22 forks source link

New Scottish Parliament Scraper #172

Closed ajparsons closed 4 months ago

ajparsons commented 4 months ago

This adds a new scraper for the Scottish Parliament's new site.

I've made a new sp_2024 folder and pulled across some of the elements needed for the ID parser.

There are three main steps:

(common and resolvenames are lightly re-formatted versions of the modules from the old scraper).

This seems to work as I'd expect for some recent ones - haven't tested actually loading the data.

There's some more special case stuff that could be loaded from the old scraper, but probably makes more sense to bring it across as things break?

ajparsons commented 4 months ago

That should be all the major issues tidied up - does this also need an adjustment in TWFY to pull from the sp_2024 directory it puts the finished files in?

dracos commented 4 months ago

Yep, once the parser is updated and has pulled in some data, we can update it so it starts loading in from there