当我连续爬取时出现403？怎么解决

CrawlScript / WebCollector

WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.

https://github.com/CrawlScript/WebCollector

GNU General Public License v3.0

3.07k stars 1.45k forks source link

当我连续爬取时出现403？怎么解决 #77

Closed df8305909 closed 6 years ago

df8305909 commented 6 years ago

Server returned HTTP response code: 403 for URL:

hujunxianligong commented 6 years ago

这个是反爬虫的问题，上代理吧