Open sentry-io[bot] opened 8 months ago
@flooie, flagging for you.
@mlissner CAPTCHA
Fun. Well, let's disable this scraper then, so the error stops and we're not getting bad data. Once we get our new scraper contractor, we can figure out a captcha solving service.
I took another look at this -
We can resolve this if we simply change the user agent. A normal user agent is required to avoid a block for scraping the metadata and it is required for scraping the actual PDF. If we adjust the user agent during the collection of the binary content
This should be resolved with the update to the scraper.
Another instance of this, I think:
Sentry issue: COURTLISTENER-5P4
Yes I agree. I was wondering if it was something in a queue
I doubt it. it was fixed days ago, right?
I'd have to check when it was finished pushing. It's again a captcha thing. I'm going to follow up with the court. I reached out last week and the woman was ... surprised because she never triggers it
Sentry issue: COURTLISTENER-5P9
Sentry issue: COURTLISTENER-5P7
Another mess of issues in Sentry about this one today.
Great. I'll verify which court and disable them. I have an outstanding email to the Minnesota courts about this issue I'll follow up with.
I suspect this is related to the new version of juriscraper. Perhaps one of the new scrapers isn't quite working?
Sentry Issue: COURTLISTENER-5EN