-
Can we get the ubuntu packages updated w/ the 1.0.3 release?
The latest one I see is 1.0.1.post1+g5b8c9e5-1435726464.
-
Say that I want to increase the default threshold for `SPIDERMON_UNWANTED_HTTP_CODES`.
Currently, I need to copy/paste
```
'SPIDERMON_UNWANTED_HTTP_CODES': {
code: 100 for code in [400, 407,…
-
The Cook County Clerk site uses JavaScript to load the details for events, but unfortunately, it uses ASP and therefore the AJAX requests are not as easy to fake as they would be in other web apps.
…
-
It would be great to use `reports/email/monitors/result.jinja` as the default template for `SPIDERMON_BODY_HTML_TEMPLATE`.
-
short plan:
- motivation: what & when to crawl
- the tutorial problem have to include:
- revisiting,
- state management (one-time visit, combination with revisiting),
- creation of request with…
-
I have a spider that runs differently depending on a parameter and currently all the tests are mixed together. Would it be possible to separate tests into different folders by spider arguments (`-a`)…
-
Filename - `tests/test_validators_jsonschema.py`
Line Number - 505
Command - `pytest tests/test_validators_jsonschema.py`
**Testcase in question**
```
DataTest(
name="datetime.…
-
More data to follow
Because `value_counts` is slow, any big df makes report_all awfully slow.
1. See if it can be improved
2. If not, exlude get_categories from report_all
or make a parameter li…
-
This package is great. Thanks for it and other packages from scrapinghub.
Image captions and credits are included in article body. It is messing up with article content.
Example URL
https://www…
-
Coming from here https://github.com/scrapinghub/arche/issues/83
I would like to treat missing values consistent, but I would also love to keep json schemas work and keep `spidermon` and `arche` com…