-
`date_of_birth` and `date_of_death` are DateFields and therefore need ISO-Dates: [abc_E21_Person](https://github.com/acdh-oeaw/apis-core-rdf/blob/c4fe0e3949cb1d062fa541b3b01c5f28848537f5/apis_core/api…
-
My Lua script has the following snippet:
```
local value = 10.0
splash:on_request(function (request)
request:set_header("Custom-Header", value)
request:set_header("Another-Custom-Head…
-
git clone https://github.com/scrapinghub/mdr.git .
2025 ls
2026 nano requirements.txt
2027 pip install -r requirements.txt
2028 sudo apt-get install python-numpy
2029 sudo apt-get install c…
-
Hi all i am crawling Angular website and running spalsh by enable **splash.plugins_enabled = true, splash.html5_media_enabled = true** and splash crash with below error.
**Current thread 0x00007f0a77…
-
As an avid scrapy user I find really useful the possibility of telling scrapy to only go and look at pages of a newspaper that match today's date (or this month's). This is done for two reasons:
- New…
-
I've found a case where I need to specify the "accept-encoding" header in order to correctly access the content I'm attempting to scrape (without the header the site is presenting a bot detection capt…
-
Hi,
I’ve been following the docs on how to use Adblock Plus to speed up the rendering of the pages I’m hitting. I’m using it inside Docker (Docker Toolbox 1.11.1b) on Windows 10.
I’m unsure of what …
-
After upgrade to Python 3 with urllib problem with parsing last modification date from webserver like
Wed, 21 Jun 2017 11:35:20 +0000
The now used dateutil parser seems not to be able to parse i…
-
No apparent way to get a list of accepted regions for use with the region argument for parse.
If there is it should be in the docs.
Also I went through the code and still not clear what exactly …
-