-
_I do not own these comments, these were copied from my old Wordpress.com blog verbatim, in case it helps other readers._
Author: GM
Thank you for these tutorials. I had a hard time finding the …
-
Running 'docker-compose up' fails with:
ERROR: Error: image kinetic/nutch:latest not found
-
Sub tasks:
+ Define Scoring plugin interface
+ Port over Cosine Similarity from Nutch to Sparkler
+ Port over Naive Bayes Filter from Nutch to Sparkler
+ Integrate Domain Explorer code into this …
-
This is a big one, but it's possible that most of this crawler should be replaced with Apache Nutch or similar. I originally hacked this out as a proof-of-concept but as usual, it grew a bit from the…
-
Hi,
I'm using apache-nutch-1.8.
I have difficulties compiling your plugin as .job file.
I've already put your plugin(extracteded from zip file) inside $NUTCH_HOME/src/plugin/, but the building proces…
arkka updated
10 years ago
-
The Seed API was refactored according to https://issues.apache.org/jira/browse/NUTCH-2090.
The python code needs to be refactored to match the specifications of the new API.
Or we need backward comp…
-
Since nutch 2.0 the HtmlParserFilter was renamed to ParserFilter, the changes around ParseFilter are many, it would be nice to make filter-xpath Nutch 2.x compatible.
-
In http://www.apache.org/dist/lucene/ we have these folders:
```
[DIR] java/ 2017-02-14 08:33 -
[DIR] mahout/ 2015-02-17 20:27 -
[DIR] nutch/ …
-
To index files on our internal share, it would be nice to mount the directory in readonly mode and then plug the mounted directory into Nutch using a custom [Protocol](https://github.com/meltmedia/nut…
-
This is probably a problem with my setup rather than your plugin.
I have nutch-2.3.1 and have installed your plugin to get rid of a bunch of navigation elements, breadcrumbs, footers components fr…