-
Nutch is a webscraping tool; the goal here is to train it to gather some documents from the web, for storage in SOLR.
We should take good notes about how to use Nutch, and any observations about how…
-
Hi there.
This is great work. I have it working on Nutch-2.4 (Feb 2021).
Question: why would such an important plugin such as this not have been integrated into the 1.x stream?
Is there an…
-
-
-
```
if you want to use the plugin with the new version of nutch the extensionpoint
is missing.
Exception in thread "main" java.lang.RuntimeException: Plugin
(language-detector), extension point: or…
-
```
if you want to use the plugin with the new version of nutch the extensionpoint
is missing.
Exception in thread "main" java.lang.RuntimeException: Plugin
(language-detector), extension point: or…
-
Hi, I just installed this crawler and I'm having an issue. Testing the crawler with just one URL and it seems to get stuck on the nutch InjectorJob, nothing happens after the following:
```
[nutc…
-
```
What steps will reproduce the problem?
1.
2.
3.
What is the expected output? What do you see instead?
What version of the product are you using? On what operating system?
Please provide any a…
-
```
What steps will reproduce the problem?
1.
2.
3.
What is the expected output? What do you see instead?
What version of the product are you using? On what operating system?
Please provide any a…
-
```
if you want to use the plugin with the new version of nutch the extensionpoint
is missing.
Exception in thread "main" java.lang.RuntimeException: Plugin
(language-detector), extension point: or…