balzer82 / immoscraper

Immoscout24.de scraper and data analytics
48 stars 24 forks source link

HTTPError #6

Open Freelix123 opened 4 years ago

Freelix123 commented 4 years ago

Since a couple of weeks I get the following error, without having changed anything...

Traceback (most recent call last):

File "", line 1, in page_soup = BeautifulSoup(urllib.request.urlopen("https://www.immobilienscout24.de/Suche/de/berlin/berlin/wohnung-mieten").read(),"lxml")

File "C:\Users\Felix\Anaconda3\lib\urllib\request.py", line 222, in urlopen return opener.open(url, data, timeout)

File "C:\Users\Felix\Anaconda3\lib\urllib\request.py", line 531, in open response = meth(req, response)

File "C:\Users\Felix\Anaconda3\lib\urllib\request.py", line 641, in http_response 'http', request, response, code, msg, hdrs)

File "C:\Users\Felix\Anaconda3\lib\urllib\request.py", line 569, in error return self._call_chain(*args)

File "C:\Users\Felix\Anaconda3\lib\urllib\request.py", line 503, in _call_chain result = func(*args)

File "C:\Users\Felix\Anaconda3\lib\urllib\request.py", line 649, in http_error_default raise HTTPError(req.full_url, code, msg, hdrs, fp)

HTTPError

Is this representing a loading error? The URL is correct and works if implemented manually. Does this mean that Immoscout has implemented means to protect itself against web scrapers?

I have also tried ths script with implementing the changing browser profiles which aims to act like more natural querries. This has also not helped.

Thanks for your help!