norconex-committer Search Results

318 results
for norconex-committer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/importer #66

StripBetweenTransformer different behavior

Using stripBetween transformer to delete headers and footers from documents in preParseHandlers For most documents all is fine, but for some specified pages footer is not removed. On all pages markup…

aleha84 updated 6 years ago
7
Norconex/crawlers #397

Processing URLs that redirect - Question and Feature Request

I have a database of URLs relevant to one or more health topic. I am indexing these existing health topics, for which I've written: * An URL provider that returns them from a database * A tagger …

danizen updated 6 years ago
15
Norconex/importer #73

Re-crawl doesn't recognize all files already indexed

Hi, I have ~4.7M files already indexed and re-ran the crawler to see how long it would take to crawl on second attempt. The first (initial) crawl took 1 day 10 hours. The second attempt I started la…

jmrichardson updated 6 years ago
10
Norconex/crawlers #405

Question: When crawling a website, how to transform ID with …

Hello all, While crawling a huge website, sometimes I would ran into having troubles with the id of my document being to large (in case of cloudsearch for example). I wanted to know if it's pos…

dgomesbr updated 6 years ago
3
Norconex/crawlers #420

Basic Authentication Not Logging In

I'm attempting to crawl a password protected wiki that we use for internal documentation and I'm struggling with getting authentication to work. I've tried to use form authentication as well as basic…

dhildreth updated 6 years ago
4
Norconex/committer-elasticsearch #19

Feature request: Specify ES settings and mappings

I have 4 requirements to configure specific aspects of ES: 1. Set field limit to 2000 2. Create custom analyzers and tokenizers 3. Create nested fields 4. Set specific field properties (document…

jmrichardson updated 6 years ago
6
Norconex/committer-core #13

Change XML output file name

Hi Pascal, I was taking a look at the XMLFileCommitter in this webpage: https://www.norconex.com/collectors/committer-core/latest/apidocs/com/norconex/committer/core/impl/XMLFileCommitter.html a…

fleitonSearch updated 7 years ago
3
Norconex/crawlers #399

Tuning collector for high performance

Hello community, I'm new to Norconex and ended up doing this for trying to optimize my website crawling scenario: ``` java -server -Xms2048m -Xmx2048m -XX:NewSize=512m -XX:MaxNewSize=512m -XX:P…

dgomesbr updated 6 years ago
3
Norconex/committer-elasticsearch #11

Facebook crawler using on events, elastic committer commits …

I'm using Norconex crawler on facebook Graph API /events/ and it is crawling down the data, but when it commits it to the elastic kibana sees the data in one block, so it cannot "index" it. As I kn…

Songi221 updated 6 years ago
9
Norconex/crawlers #350

How do I filter out SVG and other image files?

I'm very new to Norconex and am trying to configure it to crawl a site and add it to an existing Solr index. I've got a lot of issues, but I'll start with this one. When I run the crawler, it is inclu…

dkh7m updated 7 years ago
7

上一页 1...16 17 18 19 20 21 22...32 下一页

318 results for norconex-committer

318 results
for norconex-committer