issues
search
songgeb
/
BDIndexSpider
百度指数爬取工具,基于webdriver。开放源码提供一个抓百度指数的思路
https://songgeb.github.io/2017/01/29/%E7%99%BE%E5%BA%A6%E6%8C%87%E6%95%B0%E7%88%AC%E5%8F%96%E5%B7%A5%E5%85%B7/
83
stars
23
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
百度账户登入成功后,搜索栏一直未输入关键词,初始化失败,
#21
GUOxingyu3018
closed
5 years ago
0
无限重试,并报错
#20
rowankid
opened
5 years ago
1
新版本输入账号密码的时候会卡住
#19
lucifess
opened
5 years ago
0
11月16日使用报错NoSuchElementException: Cannot locate an element using By.id: TANGRAM__PSP_4__userName
#18
I321098
closed
5 years ago
2
按照省份爬取的时候经常会有6个月没掉
#17
MinardWu
opened
6 years ago
5
为什么总是自动爬取nba?
#16
shiyishiaa
closed
6 years ago
4
无法识别图片中的数字
#15
GUOxingyu3018
closed
6 years ago
4
适配新版本chrome浏览器
#14
songgeb
closed
6 years ago
1
某度指数页面有更新,导致工具异常
#13
songgeb
closed
6 years ago
1
有按地域搜索关键词需求
#12
johnqoe
closed
6 years ago
1
登录时,有时会要求发送手机验证码
#11
songgeb
opened
6 years ago
7
无法抓取到当月的数据,比如当前是4月,无法抓取4月的数据
#10
songgeb
closed
6 years ago
1
Win 10系统,抓取的图片ocr识别率比较低
#9
songgeb
closed
6 years ago
3
优化抓取策略,不用webdriver进行指数渲染和截图。改为直接下载图片、裁剪再拼接
#8
songgeb
opened
6 years ago
0
自己实现一套ocr代码,摆脱必须安装tesseract的束缚
#7
songgeb
closed
6 years ago
1
有少量抓取图片中的数字不正确
#6
songgeb
closed
6 years ago
1
关键词文件有多个关键词时导入文件会出错
#5
songgeb
closed
6 years ago
1
抓取结束后,无法自动结束
#4
songgeb
closed
6 years ago
1
虽然抓取效率有所提升,但经常抓到无指数数据的图片
#3
songgeb
closed
6 years ago
1
频繁抓取,百度指数会限制该账号访问
#2
songgeb
opened
6 years ago
16
单独运行jar文件时,导入关键词文件时,会提示URI is not hierachical错误
#1
songgeb
closed
6 years ago
1