norconex-importer Search Results

413 results
for norconex-importer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/committer-elasticsearch #3

HTTP collector never exits when committing to elasticsearch

When committing to elasticsearch (see the below config), the `collector-http.sh` script never terminates even though the crawler run has already ended. I have to manually kill the process using `CTRL+…

niels updated 7 years ago
8
Norconex/importer #47

question - itemscope and itemtype

Using the technique in #44, I discovered I didn't need to do anything to extract schema.org metadata, because either Norconex importer or Tika will create metadata for objects within an itemscope. …

danizen updated 7 years ago
15
Norconex/importer #54

DOMSplitter that also extracts title

A selector for both content and title would be nice to have.

OkkeKlein updated 7 years ago
6
Norconex/crawlers #41

Please provide a sample setup to crawl a website and store t…

Please provide a sample setup to crawl a website and store the content in Solr repo. Also we have other requirements like, indexing Metadata, skip certain URLs, parsing only part of a content page and…

raviks007 updated 7 years ago
8
Norconex/importer #48

Boilerpipe usage on importer

hi there I am trying to figure it out how to use the Boilerpipe jar file, however I am not able to do it. could you please post some basic instructions or share with me an address ? thanks a lot

angelo337 updated 7 years ago
4
Norconex/crawlers #346

HTTP collector and solr.

Hi, can you help me? I try to run minimum example and get no errors but no data appear in the solr core. ``` :/opt/norconex-col$ ./collector-http.sh -a start -c examples/minimum/minimum-config.xml …

or-dos updated 7 years ago
4
Norconex/importer #53

TitleGeneratorTagger not detecting headers as expected

When the header contains a period in a domain name or has 2 sentences (2 periods or 1 period and question mark) followed by newlines it is not used as title.

OkkeKlein updated 7 years ago
6
Norconex/collector-filesystem #15

Nested fields

Hi, I am looking for a way to create a nested field with the following structure in my elastic search ingested documents: ``` color:{ type:"nested", properties:{ level:{type:"integer"}…

jmrichardson updated 7 years ago
4
Norconex/crawlers #374

Crawling only the first of multiple identical <section> elem…

Hi Pascal, I have a site with multiple identical ```` tags on the same level. The content that I want to parse is in the first one. How can I do that? Tried with combinations of StripAfterTrans…

sveba updated 7 years ago
3
Norconex/crawlers #338

Importer Handlers ignored

I´d like to setup a crawler to feed my Solr instances. This is my setup: # Configuration ## Solr https://github.com/Norconex/committer-solr/tree/master/norconex-committer-solr/src/test/java…

BorisGuenther updated 7 years ago
4

上一页 1...22 23 24 25 26 27 28...42 下一页

413 results for norconex-importer

413 results
for norconex-importer