Closed BigDatalex closed 1 year ago
Avoid scraping similar products on otto
Set original_URL in base spider on first request, as it is used later in the zalando spider: https://github.com/calgo-lab/green-db/blob/0c14157e749f0aea11c0e082e503b940d48b8570/scraping/scraping/spiders/zalando_de.py#L25-L28
refactor some zalando URL mappings for PANTS/SHORTS and JACKET/SWEATER
Avoid scraping similar products on otto
Set original_URL in base spider on first request, as it is used later in the zalando spider: https://github.com/calgo-lab/green-db/blob/0c14157e749f0aea11c0e082e503b940d48b8570/scraping/scraping/spiders/zalando_de.py#L25-L28
refactor some zalando URL mappings for PANTS/SHORTS and JACKET/SWEATER