-
I'm trying to run a local instance of Portia. Tried to follow the "vagrant up" method, but I get the following error:
> The SSH command responded with a non-zero exit status. Vagrant
> assumes that …
-
See https://github.com/scrapinghub/scrapylib/pull/72#issuecomment-225969154
-
We need a set of Rest services to be able to pass crawl requests into the Kafka API to be processed by the Kafka Monitor. Ideally this uses something small like Flask and will run on a server that has…
-
Hi!
It would be great if you could add a filter that can prevent the RSS feed to take certain html items.
It could be a field with CSS class to prevent that a specific HTML element
being incorporate…
-
A ticket to discuss and reference Spider development in other languages ([GSoC idea](http://gsoc2015.scrapinghub.com/ideas/#other-languages)).
The goal is to allow creating spiders in any language, t…
-
## Link to Container Collection Log, Build, or Collection (in that order)
https://github.com/CAIsr/singularity-minc-v2-develop
https://singularity-hub.org/collections/352/
### Behavior when Bui…
-
I would like to open up a discussion for Scrapy Cluster as to how it can be easier to work with.
As of right now, SC 1.1 (almost ready) allows you to do local development on a single Virtual Machine…
-
UPDATE 2: I believe this IS a bug although I'm not sure if it's in scrapy or deltafetch.
I have resolved the issue, or least implemented a workaround, IMHO this is a bug.
My spider looks like this:…
-
What do you think about renaming this project to `scrapy-splash` to follow the standard naming convention of [scrapy plugins](https://github.com/scrapy-plugins)?
In the past, we had another `scrapy-s…
-
When I try import SplashRequest error occurred:
from scrapyjs import SplashRequest
Traceback (most recent call last):
File "", line 1, in
ImportError: cannot import name SplashRequest
but
import …