CrawlScript / WebCollector

WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
https://github.com/CrawlScript/WebCollector
GNU General Public License v3.0
3.07k stars 1.45k forks source link

Exception when updating db, java.lang.InterruptedException,org.openqa.selenium.remote.UnreachableBrowserException: Error communicating with the remote browser. It may have died. #83

Closed mdzz9527 closed 6 years ago

mdzz9527 commented 6 years ago

爬虫过程中出现了这个问题,查到了和org.openqa.selenium.remote.UnreachableBrowserException: Error communicating with the remote browser. It may have died.有关的问题,但是也没有好的解决方法,说是和tcp的连接有关,可是我查了服务器的tcp连接没有关系的。如果你有见解的话烦请指点,谢谢!

hujunxianligong commented 6 years ago

已经添加了RocksDB内核,可以试试

mdzz9527 commented 6 years ago

谢谢,我试试

hujunxianligong commented 6 years ago

记得用2.72-beta版,选择rocks下面的BreadthCrawler