CrawlScript / WebCollector

WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
https://github.com/CrawlScript/WebCollector
GNU General Public License v3.0
3.07k stars 1.45k forks source link

当我连续爬取时出现403?怎么解决 #77

Closed df8305909 closed 6 years ago

df8305909 commented 6 years ago

Server returned HTTP response code: 403 for URL:

hujunxianligong commented 6 years ago

这个是反爬虫的问题,上代理吧