norconex-importer Search Results

414 results
for norconex-importer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/importer #4

Irregular behaviour of TextBetweenTagger

I'm using TextBetweenTagger in order to acquire HTML code from crawled pages. The configuration looks like: ``` ^.* .*$ ``` However, this has pu…

Betongsuggan updated 9 years ago
6
Norconex/importer #2

Importer Fork

Hello, I plan to create a copy of existent importer which contains some additional specific functional. It is possible I misunderstood all importer's configuration options and creating a duplicate: …

AntonioAmore updated 9 years ago
18
Norconex/crawlers #46

Custom Committers and version 2.*

I just checked documentation and can't find any link/text which may help me with refactoring of the existent committer. It is possible to use Importer also (I agree - a dirty hack), I guess, but the l…

AntonioAmore updated 9 years ago
5
Norconex/crawlers #45

Trying to start 2.0.1 with old configs.

Hello! Haven't visited you for a long time :) I cleaned workdir and tried to launch the collector via command line getting such exceptions: ``` WARN [ConfigurationUtil] Could not instantiate objec…

AntonioAmore updated 9 years ago
5
Norconex/crawlers #47

Possible temporary working directories bug

I have following config for a crawler: .xml ``` #set($http = "com.norconex.collector.http") #set($core = "com.norconex.collector.core") #set($urlNormalizer = "${http}.url.impl.GenericURL…

AntonioAmore updated 9 years ago
2
Norconex/crawlers #54

filling Solr "content" field

In collector http configuration file I have the sentence: text In Solr, "text" field is defined as: indexed="true" stored="false" On the other hand I'd need to use Solr "content" field (indexe…

csaezl updated 9 years ago
12
Norconex/crawlers #43

1.34 to 2.0 Conversion

I moved my configuration over from 1.34 to 2.0 and I receive the following error: ERROR com.norconex.importer.Importer - Unsupported Import Handler: null Is there additional configuration that I nee…

jeffcheney updated 9 years ago
5
Norconex/crawlers #42

Java application for crawling purpose with collector-http

Hi, I need to use collector-http to get data from several sites which fulfill some regular expression and store them in a database via Java application. Is this possible with collector-http, and how …

comcrawler updated 9 years ago
4
Norconex/crawlers #23

Make all in- links metadata and anchor texts accessible from…

Is it possible to retrieve the anchor text and metadata of all crawled links pointing to one crawled document? The problem I'm facing is setting a readable name on crawled document and the only human…

leonardsaers updated 9 years ago
6
Norconex/committer-core #3

Custom MySQL commiter implementation

Hello! I write my own committer implementation to put collected pages into MySQL database. As an example I've taken SolrCommiter - is it a right decision? So I inherited from AbstractMappedCommitt…

AntonioAmore updated 10 years ago
5

上一页 1...36 37 38 39 40 41 42...42 下一页

414 results for norconex-importer

414 results
for norconex-importer