-
Hi, I got this problem with my Collector Http 2.9.0 installation:
a) collector crawls with a 1 day delay
b) keepDownloads is false to save disk space
c) collector only crawls urls listed in a text …
-
We’re having a problem with special characters in the importer.
We use a ScriptTransformer to extract certain patterns from the content and insert the extracted values as metafields on the document…
-
When we use the idol-committer it seems unable to extract information from documents smaller than around 8kb.
For small documents neither metadata or content is extracted, only the metadata represent…
-
Dears,
I am running a connector instance configured with about 40 active crawlers. Sometimes, in the log I read:
java.io.FileNotFoundException: D:\connettori_norconex\norconex-collector-http-2.8.1\.…
-
Hi
i want to collect pages from rss feed
this is my crawler but no result
please help me
``` xml
./examples-output/minimum/progress
./examples-output/minimum/logs
4
1
-1
…
-
At committer.commitBatch() function I try to get page's content for database writing.
``` java
public class CustomCommitter extends AbstractMappedCommitter {
...
@Override
protected void comm…
-
I would like to know may collector-http fit requirements for the following task (I appreciate your precious time, and read documentation first, but haven't found some nuances):
A set of hundreds web …