norconex-importer Search Results

413 results
for norconex-importer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/importer #38

new DOMTagger's "defaultValue not working?

Hi, I'm trying to use the new feature to assign a new value in case no match is found, But I can seems to get it to work. I've got a general tagger that extract the part of the page with members of …

liar666 updated 7 years ago
2
Norconex/crawlers #299

Dynamic committer routing based on language

I'm trying to figure out a way to route a crawled document to a specific Elasticsearch index based on the language of the document. I am using the `com.norconex.importer.handler.tagger.impl.LanguageTa…

zdrd updated 8 years ago
5
Norconex/importer #29

Bug un ReplaceTagger?

Hi, For a given crawler, I extract/tag a field EXP_NAME+COUNTRY that contains both the name and the country of an author (in the format "firstname other-names lastname [CountryCode]"). Thanks to a R…

liar666 updated 8 years ago
3
Norconex/importer #24

[DOMSplitter] StackOverflow with norconex-importer 2.5.2

with the following configuration (crawling depth 0): ``` [...] ``` I get a StackOverflowError : _java.lang.StackOverflowError at java.io.UnixFileSystem.getB…

sylvainroussy updated 8 years ago
2
Norconex/crawlers #286

Java exception: NoSuchMethodError when running minimum test

I've downloaded the software onto an Ubuntu 14.04 system with this java: java version "1.7.0_111" OpenJDK Runtime Environment (IcedTea 2.6.7) (7u111-2.6.7-0ubuntu0.14.04.3) OpenJDK 64-Bit Server VM (…

pcolmer updated 8 years ago
4
Norconex/importer #34

using ScriptTransformer with a tagged field

hi there I am having an issue with a transformation I am trying to put in place after I capture some information from the content field like this: ``` xml ((([U|u](NION|nion)\s[T|t](EMPORAL|empor…

angelo337 updated 8 years ago
2
Norconex/crawlers #250

I am trying to configure its importer module to strip what's…

I am trying to use Norconex HTTP Collector to configure its importer module to strip what's between headers, rightnavs and footers but is does not seem to be stripping what is between these known tag…

mitchelljj updated 8 years ago
9
Norconex/crawlers #283

Running multiple committers at the same time not possible?

Hi, I'm currently developing my own committer. In order to debug my own code, I wanted to keep the FileSystemCommitter , so that I can compare the output of both committer. The configuration file lo…

liar666 updated 8 years ago
5
Norconex/crawlers #267

OOM crawling PDF.

DEBUG [CachedInputStream] Deleted cache file: /tmp/CachedInputStream-4437588446019026996-temp Exception in thread "pool-1-thread-1" INFO [AbstractCrawler] My Crawler Name: Deleting orphan references …

OkkeKlein updated 8 years ago
9
Norconex/crawlers #281

no crawling occur if the URL was direct to another page

I used to insert URL without writing the home page, as an example: ` https://en.wikipedia.org/ ` instead doing ` https://en.wikipedia.org/wiki/Main_Page ` in first case it doesn't crawl, but in the…

doaa-khaled updated 8 years ago
5

上一页 1...27 28 29 30 31 32 33...42 下一页

413 results for norconex-importer

413 results
for norconex-importer