Closed harrisj closed 9 years ago
Good tip, thanks. We have previously fought with BeautifulSoup's parser autodetection. (see 6e34957c9e77ccea7f2ee742817bcbbc8fffdab8) The upshot is that we should be using "lxml", which will get installed if you run pip install -r requirements.txt
. We should explicitly pass "lxml" to the BeautifulSoup constructor wherever we use it; if you could whip up a PR for that, it would be much appreciated!
And I was just about to submit my own pull request where I pulled the beautifulsoup_from_url into utils and replaced a bunch of calls in the scrapers with a reference to that... Do you still want it?
Sent from my iPhone
On Oct 12, 2015, at 6:55 AM, David Cook notifications@github.com wrote:
Closed #251 https://github.com/unitedstates/inspectors-general/issues/251 via 5a102c0 https://github.com/unitedstates/inspectors-general/commit/5a102c04b8916ed9e83eeb4ea0147f6154b6c556 .
— Reply to this email directly or view it on GitHub https://github.com/unitedstates/inspectors-general/issues/251#event-432818695 .
That sounds better, I'll revert my commit.
Fixed by #252.
Should I go ahead and do a pull request to fix this. It's not breaking things, it's just mildly annoying.