carpentries-incubator / lc-webscraping

Introduction to web scraping
https://carpentries-incubator.github.io/lc-webscraping/
Other
37 stars 28 forks source link

Legislative Assembly of Ontario Page Structure changes #24

Open RichardPBerry opened 6 years ago

RichardPBerry commented 6 years ago

Using this page as the closest approximate for http://www.ontla.on.ca/web/members/members_current.do, it seems that the page structure has changed significantly from the version shown in episode 4 (starting here).

One possibility may be to use a selector like this //tbody/tr//a/@href, but unfortunately that won't show the contains function well. Another option might be to show selecting every second row using a selector like this? //*[contains(concat(" ", normalize-space(@class), " "), " even ")]