-
It would be good to have a way to manage multiple browsers contexts, either defined at the start of the crawl (in settings, for instance) or created/destroyed during the crawl.
/cc @kalessin
-
页面之间的数据传递,有没有更简便的方法,二级页面只有一个数据,还要新建个Item类,能不能在一个Item类中实现
代码如下:
```python
import asyncio
import aiofiles
from ruia import AttrField, TextField, Item, Spider, Middleware, Request, Response
mid…
-
Every day a few times HASS loses ithe casambi light entities.
Reloading the integrations does not fix the problem.
The entity status is set on a exclamation mark with a rounded arrow cirkle.
I…
-
#!/usr/bin/env python
# _*_ coding: utf-8 _*_
# @Time : 2021/7/19 5:48 下午
# @Author : zhanglonglogn
# @Version:V 0.1
# @desc
from client_sdk_python import Web3, HTTPProvider, WebsocketProvider
…
-
python3.9 部分方法更改,可加入版本判断
spider.py
源函数
async def _cancel_tasks(self):
tasks = []
for task in asyncio.Task.all_tasks():
if task is not asyncio.tasks.Task.current…
-
```
import aiohttp
import asyncio
from ruia_pyppeteer import PyppeteerRequest as Request
async def request_example():
url = "https://deck.tk/07Pw8tfr"
params = {
'name': 'ru…
-
我参考的是 https://github.com/howie6879/ruia/blob/master/docs/en/tutorials/item.md 里的代码
```
import asyncio
from ruia import Item, TextField, AttrField
class PythonDocumentationItem(Item):
ti…
-
Hello!
I am facing this issue:
```py
Traceback (most recent call last):
File "c:\tools\anaconda3\envs\virtual_workspace\lib\site-packages\twisted\internet\defer.py", line 662, in _runCallbacks…
-
I leave my logs. Previously old versions that ultimately not working. After cleaning and reinstalling can not see entities and logs are saying I am using old version after installing v1.0.3?
```
2…
-
目前在测试添加代理ip的时候,只有第一次爬取会启用@middleware.request中间件,后续重试的时候就没有了,导致如果添加了代理ip,代理ip失效的情况下无法自动更换