-
Sería interesante poder exponer la BBDD de cada instancia de columnistos a través de una API.
Con ello se podrían hacer visualizaciones y crear plugines y cosas por el estilo.
-
I fail to see how I should "Eggifying" a project that uses DjangoItem.
Am i supposed to bundle the django project too within the egg ?
-
This is a big one, but it's possible that most of this crawler should be replaced with Apache Nutch or similar. I originally hacked this out as a proof-of-concept but as usual, it grew a bit from the…
-
https://github.com/pytest-dev/pytest-twisted/pull/94/checks?check_run_id=692810777
```
============================= test session starts ==============================
platform linux -- Python 3.6.…
-
-
In single node scrapy project, the settings like below as your document indicate works well.
```
# ====== Splash settings ======
SPLASH_URL = 'http://localhost:8050'
DOWNLOADER_MIDDLEWARES = {
…
-
**Goal**
Which URL(s) are reading a specific cookie.
**What I want to see**
A cookie is set by a URL -> list every URL that read this cookie
**Question**
Somehow display that on the hos…
-
We could support plugins for pre- and/or post-processing the document analysis functionality.
A plugin could be a subclass of a class like this:
```python
class AnnifPlugin:
"""A plugin th…
-
I have a Spider that crawls multiple URLs (100+) and I plan using both `Requests` and `SplashRequests` as the website needs JavaScript rendering. And, I am thinking of running [Spiders simultaneously]…
-
While working on documentation for https://docs.zyte.com/ about setting request metadata, I am starting to think that maybe we should not send `echoData` to the server, and instead keep track of it on…