issues
search
code4craft
/
webmagic
A scalable web crawler framework for Java.
http://webmagic.io/
Apache License 2.0
11.44k
stars
4.18k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
如何保持cookie不过期?
#1130
getideas
opened
1 year ago
0
请教怎样控制爬虫延时或者暂停?
#1129
Mr-LiuDC
opened
1 year ago
3
请教怎样实现一个基于数据库的Scheduler?
#1128
Mr-LiuDC
opened
1 year ago
3
fix(sec): upgrade net.sourceforge.htmlcleaner:htmlcleaner to
#1127
dack-su
closed
1 year ago
0
[Snyk] Security upgrade net.sourceforge.htmlcleaner:htmlcleaner from 2.26 to 2.29
#1126
code4craft
closed
1 year ago
0
WebMagic Extension version 0.9.0 has CVE-2023-2976 vulnerability
#1125
JackLinkai
opened
1 year ago
0
HttpClientDownloader download error
#1124
mayn7z
closed
1 year ago
1
java.lang.OutOfMemoryError: Java heap space
#1123
meikey
opened
1 year ago
1
There's a code injection vulnerability of `us.codecraft.webmagic.downloader.PhantomJSDownloader`
#1122
LetianYuan
opened
1 year ago
0
[Snyk] Security upgrade org.seleniumhq.selenium:selenium-java from 3.141.59 to 4.0.0
#1121
code4craft
opened
1 year ago
0
当使用代理的时候,如果链接代理超时,最后使用Lister监听状态会被算进成功里面,而不是失败!
#1120
konglquan
opened
1 year ago
0
[Snyk] Security upgrade com.google.guava:guava from 31.1-jre to 32.0.0-jre
#1119
code4craft
closed
1 year ago
1
如何让多个spider顺序执行
#1118
getideas
opened
1 year ago
1
The domain webmagic.io has expired
#1116
x6770
closed
1 year ago
1
xpath匹配标签使用或后得到的结果集不是按顺序出现
#1115
wanygan83
opened
1 year ago
0
[Snyk] Security upgrade com.jayway.jsonpath:json-path from 2.7.0 to 2.8.0
#1114
snyk-bot
closed
1 year ago
0
爬Twitter遇到JavaScript 不可用问题。
#1113
wayss000
opened
1 year ago
1
Https绕过host检查
#1112
Tanky-Zhang
closed
1 year ago
1
webmagic0.75,0.8.0 RedisScheduler无法使用
#1111
Bertluo
opened
1 year ago
1
想要获取到最后一次重定向到的最终url
#1110
keatonLiu
opened
1 year ago
0
No appropriate protocol (protocol is disabled or cipher suites are inappropriate)
#1109
ChenSino
closed
1 year ago
1
向 webmagic-saxon 组件提供若干新 API,更优雅更灵活更强大
#1108
hooyantsing
closed
1 year ago
0
修复 HtmlCleaner 无法正常解析 tr 和 td 标签的问题
#1107
hooyantsing
closed
1 year ago
0
Can Playwright be supported
#1106
holmofy
opened
1 year ago
1
请求支持JQuery遍历API
#1105
w3l7
opened
1 year ago
0
待爬取的链接数正常,但爬取结束后的结果数和链接数不一致
#1104
w3l7
closed
1 year ago
5
可不可以和scrapy一样,对每个url定义不同的请求方式和参数等
#1103
keatonLiu
closed
1 year ago
0
如何在PageProcessor的process里面实现点击操作?
#1102
Mr-LiuDC
opened
1 year ago
1
使用setCharSet()后无法自动推测网页编码,导致网页乱码
#1101
keatonLiu
closed
1 year ago
6
有微信群吗?
#1100
byedo
opened
1 year ago
4
建议addTargetRequests方法支持所有Iterable<String>
#1099
keatonLiu
closed
1 year ago
0
Integrate URLFrontier as a backend for URL storage
#1098
jnioche
opened
2 years ago
0
javax.net.ssl.SSLHandshakeException: Received fatal alert: protocol_version
#1097
keatonLiu
closed
1 year ago
4
某些情况下爬虫会莫名其妙卡住不动,但状态是Running
#1096
keatonLiu
closed
2 years ago
3
fix(sec): upgrade com.fasterxml.jackson.core:jackson-databind to 2.14.0-rc1
#1095
pen4
closed
2 years ago
1
HttpClientDownloader不要捕获错误而是把错误抛出来
#1094
keatonLiu
opened
2 years ago
7
Processor中使用page.putField()保存对象数组,数据量较大时,没进Pipeline里
#1093
Golne
closed
2 years ago
1
CountableThreadPool 的意义何在
#1092
MaLuxray
opened
2 years ago
0
修改WebDriverPool源码指定ChromeOptions出现org.openqa.selenium.chrome.ChromeOptions.addArguments([Ljava/lang/String;)Lorg/openqa/selenium/chrome/ChromeOptions;
#1091
694475668
opened
2 years ago
3
core包与springboot冲突
#1090
MaLuxray
closed
2 years ago
1
[Snyk] Security upgrade com.fasterxml.jackson.core:jackson-databind from 2.13.4 to 2.13.4.2
#1089
code4craft
closed
2 years ago
0
[Snyk] Security upgrade us.codecraft:xsoup from 0.3.4 to 0.3.6
#1088
snyk-bot
closed
2 years ago
0
[Snyk] Security upgrade com.fasterxml.jackson.core:jackson-databind from 2.13.2.1 to 2.13.4
#1087
snyk-bot
closed
2 years ago
0
Enhance Jsoup could parse tr td tag directly
#1086
vioao
closed
2 years ago
0
Common downloader error process
#1085
vioao
closed
2 years ago
0
Common the downloader status process and pass error information when …
#1084
vioao
closed
2 years ago
0
Revert "Common the downloader status process and pass error information when …"
#1083
sutra
closed
2 years ago
0
Common the downloader status process and pass error information when …
#1082
vioao
closed
2 years ago
2
downloader 设置 proxy 与 site 设置 proxy 有区别吗?
#1081
nesteiner
opened
2 years ago
1
downloader 设置 proxy 与 site 设置 proxy 有区别吗?
#1080
nesteiner
opened
2 years ago
0
Previous
Next