waue0920 / crawlzilla_old

Automatically exported from code.google.com/p/crawlzilla
0 stars 0 forks source link

索引 #18

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1.
2.
3.

What is the expected output? What do you see instead?

What version of the product are you using? On what operating system?

Please provide any additional information below.

Original issue reported on code.google.com by cailing0...@gmail.com on 22 Mar 2013 at 6:54

GoogleCodeExporter commented 9 years ago
索引庫名稱

爬取狀態

爬取時間

索引庫状態error
google

error: hadoop dfs -mkdir /user/crawler/admin/google broken

0h:0m:19s

请問如何解决??
谢谢

Original comment by cailing0...@gmail.com on 22 Mar 2013 at 6:58

GoogleCodeExporter commented 9 years ago
看起來是 HDFS 沒有正常建立成功。
您可以用 crawler 身份,先建立 /user/crawler/ 的 HDFS 目錄
然後再回網頁執行爬取工作。

su - crawler
hadoop fs -mkdir /user/crawler/tmp

Original comment by jazzwang...@gmail.com on 19 Dec 2013 at 10:06