biglocalnews / civic-scraper

Tools for downloading agendas, minutes and other documents produced by local government
https://civic-scraper.readthedocs.io
Other
43 stars 14 forks source link

The span does not appear to be within the h2 tag and the extract removes the h2 tag from the tree #188

Closed taz77 closed 2 months ago

taz77 commented 3 months ago

This addresses the issues in #91 and #176

Tested this change via debugger and the committee names are now being returned like they were before. I do not know the root cause for why this broke or why this worked before. I spot checked several municipalities websites in the google doc and they were all this way where the span is not in the h2 tag.

zstumgoren commented 2 months ago

Hey @taz77 heads up that an updated version of civic-scraper (0.2.10) is now on PyPI. Had to create a new PR (#190) to handle this and an unrelated issue, along with with misc headaches related to our build process. Closing this now in favor of #190, but shout back on #176 if you're still finding bugs. Thanks!