issues
search
owner888
/
phpspider
《我用爬虫一天时间“偷了”知乎一百万用户,只为证明PHP是世界上最好的语言 》所使用的程序
3.49k
stars
1.18k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
文档里的config/inc_config.php中的配置怎么设置呢
#69
xiaochengfu
opened
7 years ago
2
没有返回数据
#68
githubyyf
opened
7 years ago
0
php版本为7.1.7,导出为csv文件时路径出错。
#67
serenehaly
closed
3 years ago
1
无法入库怎么解决,$spider->on_extract_field 输出不了任何东西, 系统走不到这个流程里?
#66
spc324
opened
7 years ago
1
修复selector::$dom_auth问题
#65
liesauer
closed
4 years ago
1
getcookie 函数失效
#64
beerkala
opened
7 years ago
2
selector 不支持xpath函数
#63
code-geeker
opened
7 years ago
1
遇到一个头疼的问题
#62
ijackwu
closed
7 years ago
1
小bug一枚,字段配置repeated,phpspider类中get_fields方法未做相应处理
#61
kvman1122
closed
7 years ago
1
field定义children子项如何保存到数据库?
#60
overlook940
closed
7 years ago
0
文档bug
#59
ijackwu
closed
7 years ago
2
content_url_regexes怎么写呢?
#58
czly
opened
7 years ago
3
请问如何使用多个代理ip采集数据?
#57
czly
opened
7 years ago
1
当field有子项的时候,有些问题
#56
iamMarkchu
opened
7 years ago
1
可以只采集列表页的数据吗?
#55
a362642675
opened
7 years ago
2
请问多线程的系统环境怎么配?
#54
i1219
opened
7 years ago
3
请问是否支持直接爬取 内容页url
#53
p0h5
opened
7 years ago
3
list_page是否可以只获取特定代码块的url
#52
chent1024
opened
7 years ago
1
爬取蚂蜂窝数据出问题
#51
callmeYe
opened
7 years ago
2
能加入对HTTPS的支持吗
#50
puluter
closed
7 years ago
0
有许多重复的数据
#49
fengshengxie
opened
7 years ago
2
添加完整的编码方式属性,DOMDocument::loadHTML才能正确识别UTF-8
#48
guotd
opened
7 years ago
1
spider能不能和workerman的异步任务结合起来?
#47
shuyabin
opened
7 years ago
2
明明log_show 是false,为什么还是能看到一大堆的log在滚动(源码说明一切 - -)
#46
xczizz
closed
7 years ago
2
使用 css 选择器的时候,可能会出错。
#45
xczizz
closed
7 years ago
2
是否考虑增加支持 composer ?
#44
dryyun
opened
7 years ago
1
彩蛋有毒啊...
#43
lynxcat
opened
7 years ago
6
为什么网址中的点不用转义?
#42
ksgujie
opened
7 years ago
0
修复以 ./ 开头的相对链接拼错的bug
#41
citywill
opened
7 years ago
0
怎么将url插入到fields中呢?
#40
citywill
opened
7 years ago
2
demo里面的例子多任务爬取的文件都运行不起来?
#39
linhanbo
opened
7 years ago
3
dom解析bug,<div class="example"></div>经过dom操作再获取会变成<div class="example"/>
#38
GargantuaX
opened
7 years ago
1
爬虫如何做计划任务?
#37
zhangya4548
opened
7 years ago
3
弱弱的问一句,这个能采集js动态加载的内容吗
#36
linkkong
opened
7 years ago
2
Multitasking needs Redis support, Error: The redis extension was not found
#35
pengbo37877
opened
7 years ago
1
PHP Fatal error: Allowed memory size of 1073741824 bytes exhausted
#34
greatken999
opened
7 years ago
0
$conf配置children后,后续规则失效 bug,已定位
#33
GargantuaX
opened
7 years ago
1
文档set_hosts($hosts)有小错误
#32
CenJing
closed
7 years ago
2
log win下输出中文乱码,log不能设置是否输出到文件
#31
geri5
opened
7 years ago
1
cls_curl有用到么?
#30
geri5
opened
7 years ago
2
教程里第一个demo是不能运行的
#29
ufoe
opened
7 years ago
3
马蜂窝的数据采集不了了,因为马蜂窝网站改规则了
#28
ufoe
opened
7 years ago
0
好厉害
#27
DavisZhang2014
opened
7 years ago
0
作者有点调皮,ʅ(´◔౪◔)ʃ
#26
itcats
opened
7 years ago
3
如何下载网页内容的时候,将图片或者附件一起下载到本地
#25
allsapbooks
opened
7 years ago
1
fix requests::set_cookies function
#24
kwxiaozhu
opened
7 years ago
1
使用demo里面的例子时获取的数据装不进数据库
#23
JohnChongJC
opened
7 years ago
0
现在这个git无法爬去知乎代码?
#22
locxiang
opened
7 years ago
0
requests类设置随即伪造ip的参数有误
#21
SyanH
opened
7 years ago
1
Ctrl-C无法终止
#20
cuikangyi
closed
7 years ago
3
Previous
Next