-
The title says it all
[pip_install.log](https://github.com/datamade/usaddress/files/12848454/pip_install.log)
-
Currently ItemProvider always requires Response. So, if a page object doesn't need a response, and an item produced by this page object is used as a dependency, then Response is still downloaded.
kmike updated
10 months ago
-
Eg OGP, Twitter cards, etc, as alternatives to mf2 h-card. Ideally I'd find a single lib that supports and normalizes all of them:
* HTML meta
* Twitter cards
* OGP
* Dublin Core?
* embedded JSON…
-
I am using hcf-backend and scrapy frontera to store urls to frontier. I am trying to set priority in spider requests, with priority=10 for some url's. But frontier does not seem to obey this rule, rat…
-
`pip install python-crfsuite` aborts with an error.
Platform details:
* Python 3.12.0
* Windows 10 Enterprise, Version 22H2, OS build 19045.3448
* Processor: 12th Gen Intel(R) Core(TM) i7-12700 …
zsfc updated
10 months ago
-
Good day. It seems requirements define `docker-py` dependency which is as far as I can tell appears to be an outdated version of official docker client from https://github.com/docker/docker-py/. Seems…
-
This is a downstream issue.
## Description
It appears that `python-crfsuite` is not building under python 3.10. Discovered when trying to run 3.10 in `newspaper3k`. See [issue here](https://gi…
-
Shall we update Scrapy to 2.10.1 to prevent Twisted issue https://github.com/scrapy/scrapy/pull/6027 ?
https://github.com/scrapinghub/scrapinghub-stack-scrapy/blob/a4aa5e3ab77ce0f8d674ba52c30e0693…
-
Hi,
Because i want to use http proxy. I want to set `page.authenticate()`
Thanks!
-
i am trying to scrape data from web app , but scrappy is not working. when i opened website in scrappy shell and i used "**view(response)**" command , but website is not loading contents.