calgo-lab / green-db

The monorepo that powers the GreenDB.
https://calgo-lab.github.io/green-db/
22 stars 2 forks source link

otto sustainability filter not working #151

Closed BigDatalex closed 1 year ago

BigDatalex commented 1 year ago

The #150 causes a new error, probably due to some encoding of the commas in the URL. The logfile gives the following error:

2023-09-12 16:45:25 [scrapy.core.engine] ERROR: Error while obtaining start requests
Traceback (most recent call last):
  File "/usr/local/lib/python3.9/dist-packages/scrapy/core/engine.py", line 150, in _next_request
    request = next(self.slot.start_requests)
  File "/usr/local/lib/python3.9/dist-packages/scrapy_splash/middleware.py", line 173, in process_start_requests
    for req in start_requests:
  File "/tmp/scraping-1694529892-eug1x4r8.egg/scraping/spiders/_base.py", line 212, in start_requests
    yield self.StartRequest(
  File "/usr/local/lib/python3.9/dist-packages/scrapy_splash/request.py", line 74, in __init__
    super(SplashRequest, self).__init__(url, callback, method, meta=meta,
  File "/usr/local/lib/python3.9/dist-packages/scrapy/http/request/__init__.py", line 60, in __init__
    self._set_url(url)
  File "/usr/local/lib/python3.9/dist-packages/scrapy/http/request/__init__.py", line 108, in _set_url
    raise ValueError(f'Missing scheme in request url: {self._url}')
ValueError: Missing scheme in request url: energieeffiziente-nutzung

Would be great if you could have a look at this @en-GB and please test your solution by running the spider.