codeforsanjose / city-agenda-scraper

9 stars 16 forks source link

Improve Granicus_check function #12

Open krammy19 opened 3 years ago

krammy19 commented 3 years ago

The Granicus_check function in the html-request scraper2 sometimes grabs multiple urls in a list.

For example in this output, see the city of La Puente (line 211) or Sacramento (line 395).

This is because Granicus sometimes has separate agenda pages for different committees / commissions. We ideally want a way to identify which page is the City Council agenda and keep only that url.

In a related issue, San Diego repeats the same url twice.

xconnieex commented 3 years ago

The link you're referencing just leads back to the issues page.

krammy19 commented 3 years ago

Corrected, thanks!