-
Currently, we have the following inheritance chains:
- `BasicCrawler` -> `HttpCrawler`
- `BasicCrawler` -> `BeautifulSoupCrawler`
- `BasicCrawler` -> `PlaywrightCrawler`
- `BasicCrawler` -> `Par…
-
## Issue
URL Preview and Parsing does not currently function well.
## Proposed Solution
Implement parsing of Open Graph `` tags to facilitate auto fill of wish attributes. There may be an existin…
-
i got this issue while running the program
C:\WINDOWS\system32>facebook_page_crawler '827852074085717' 'second-app' 'appledaily.tw' '2018-03-08 14:05:00' '2018-03-08 15:00:00'-r yes
usage: faceboo…
-
# Overview
From the following PRs:
- https://github.com/zytedata/zyte-spider-templates/pull/41, https://github.com/zytedata/zyte-spider-templates/pull/50
- https://github.com/zytedata/zyte-spider…
-
## 下载课程大纲时出现编码错误
网站:中国大学MOOC
课程地址:https://www.icourse163.org/course/ZJU-93001?tid=1003013004
问题描述:
> 6.2 图的遍历(4小节共22:22)
> 【视频】6.2.1 图的遍历 - DFS (6:43)
> python.exe : Traceback (mos…
-
Chrome driver
-
Sometimes it runs, sometimes it doesn't.
```
[14:22:38] INFO::email_crawler - Crawling http://www.google.com.au/search?q=electrician&start=0
[14:22:39] ERROR::email_crawler - Exception at url: http:/…
ghost updated
6 years ago
-
## Description
This agentless crawler ecosystem looks very interesting. I would like to contribute to it but I'm new to both this project and to Python. Can someone familiar with this project add…
-
Of Python libraries that come to mind, `python-gnupg` has the nicest docstrings in my opinion. Poke around https://github.com/isislovecruft/python-gnupg/blob/master/gnupg/. You learn:
- All parameters…
-
I have just run your crawler trying to get smart contract source code from Etherscan, but received an error message. I noticed that the Etherscan is using Cloudflare for security purposes, making the …