-
I'm trying to figure out a way to route a crawled document to a specific Elasticsearch index based on the language of the document. I am using the `com.norconex.importer.handler.tagger.impl.LanguageTa…
-
Hello there!
I have been looking for some simple web crawler and I found this project and liked it very much. The problem is, that I can't find any useful tutorials for dummies and don't know how to …
-
## Definition of Done:
- Each member provides a list of underlying technologies which we're lack of knowledge, such as:7
1. Elasticsearch
2. Scikit-learn
3. Bloomfilters
4. Tensorflow
5.…
-
Initially, this will just cover the Scala Stream Collector, Stream Enrich, and Kinesis Elasticsearch Sink.
The two possible approaches were:
1. Always deploy any commit to the "develop" branch
2. Alw…
-
When kibana is running without elasticsearch, requests to /app/kibana will never reply
![image](https://cloud.githubusercontent.com/assets/3143860/15523559/ff793d8c-21e0-11e6-930f-2ee7754cdb3b.png)
jbudz updated
8 years ago
-
This ticket originated from https://github.com/Norconex/committer-elasticsearch/issues/3#issuecomment-191226573.
Because of `AbstractBatchCommitter` calling commit() for every batch, this eliminates …
-
Zipkin is not a foundation, it isn't even a single GitHub org. Zipkin spans across several orgs in github, with the most central one being here. It has stakeholders including those making zipkin emula…
-
From @bruce-genhot at https://github.com/Norconex/collector-http/issues/190#issuecomment-161634436:
> The elasticsearch committer library only works with elasticsearch 1.5, the latest version is 2.1…
-
Given the below robots.txt, a URL such as http://www.mascus.com/agriculture/used-combination-drills/tume-kl-2500/ckckapdt.html is incorrectly rejected.
```
User-agent: Mediapartners-Google
Disallow:
…
niels updated
8 years ago
-
Hi, I am trying to constructing collector and crawler by java programming, everything looks good, except the problem with missing content. something is missing ?
``` java
public static void main(Str…