Closed lavaturtle closed 7 years ago
I ran it locally and it worked fine:-
21:27:17 INFO scrapelib: GET - http://assembly.state.ny.us/mem/Chad-A-Lupinacci
21:27:24 INFO billy: Save person Lupinacci, Chad
21:27:24 INFO scrapelib: GET - http://assembly.state.ny.us/mem/William-Magee
21:27:26 INFO billy: Save person Magee, William
21:27:26 INFO scrapelib: GET - http://assembly.state.ny.us/mem/William-B-Magnarelli
21:27:27 INFO billy: Save person Magnarelli, William
21:27:27 INFO scrapelib: GET - http://assembly.state.ny.us/mem/Nicole-Malliotakis
Finally:-
21:29:26 INFO billy: Finished importing 211 legislator files.
This is strange! It looks like data.xpath('following-sibling::div[1]/text()')
(on line 432 of legislators.py) is returning an empty list for that one address, when it seems like it should be returning ['Oneonta Phone', '\n607-432-1484']
(and presumably is returning that for everyone else).
I suspect this has to be something weird about my environment, although I'm not sure what. I'm running Docker version 17.03.1-ce, build c6d412e
and docker-compose version 1.12.0-rc2, build 08dc2a4
on Ubuntu 16.10 Yakkety.
If you run without --fast does this still happen? This may be a cached bad copy of a page locally. You can also wipe out the cache/_cache dirs
Without --fast
it works! Thanks!
State: NY
When I run the New York scraper, it errors out -- looks like it's hitting an error trying to read addresses from http://assembly.state.ny.us/mem/William-Magee