norconex-importer Search Results

413 results
for norconex-importer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/crawlers #752

custom metadata tag - google cloud search plugin

Hello I am setting up norconex with the Cloud Search plugin, I have had some success with a small test, now attempting to setup for a full site. standard tagging is working and gets much of the …

ericwhiteau updated 3 years ago
5
Norconex/crawlers #674

How to send docs to a temporary storage?

@LeMoussel @essiembre Thanks, I would be interested to see that as I might have to write a committer myself, as I have to find a way to send crawled docs to temporary storage for further processing wh…

essiembre updated 3 years ago
6
Norconex/crawlers #712

How To Reject URLs indexed before a $date

Hi Pascal, I have a requirement of excluding all URLs that were indexed before a certain date , I need to exclude them next crawl onward. Now I have an additional situation as well. I added Curren…

sudeshna-majumder updated 3 years ago
5
Norconex/commons-lang #13

ConcurrentModificationException

Hello Pascal, there are some errors from time to time while the http crawler is running: ``` Exception in thread "StreamConsumer-STDOUT" java.util.ConcurrentModificationException at ja…

jetnet updated 3 years ago
8
Norconex/crawlers #707

Reference filters

Hi Pascal, First of, thank you for the excellent software. I want to crawl a very large site (10M+ pages) and i want to avoid all the search query links (containing ?, multiple keywords) and all…

bmfirst updated 3 years ago
15
Norconex/committer-elasticsearch #41

ElasticSearch Committer Error

Running the lastest 3.0.0 M1 with elasticsearch 5.0.0 m1 Per the doc, it seems like typename should be there: https://opensource.norconex.com/committers/elasticsearch/v4/configuration But it ma…

jacksonp2008 updated 3 years ago
14
Norconex/crawlers #509

sitemap.xml metadata

Does the collector add metadata (title, keywords, etc.) it find from the sitemap.xml itself or just metadata it finds inside the document itself?

cherlo updated 3 years ago
7
Norconex/crawlers #647

Communicating Norconex Committer failures

Hi, We have noticed that sometimes Norconex committer fails to index few documents for any reason. These failures cannot be communicated back to the crawl which updates the checksum and will not be p…

arunakanaparthy updated 3 years ago
3
Norconex/crawlers #751

Version 3: Error: HTTP content length exceeded 2097152 bytes

Hi There i using Version 3 (for testing) with Google chrome as http fetcher. All works fine but when documents exceeded 2097152 bytes i got errors (full log see below) `io.netty.handler.codec.…

phpsyscoder updated 3 years ago
5
Norconex/crawlers #739

Crawler not pulling page content

The Crawler doesn't seem to be pulling content for one of my sites. I can see every other field in the data but not the content. The only material difference between this config and my other working…

jacksonp2008 updated 3 years ago
2

上一页 1...4 5 6 7 8 9 10...42 下一页

413 results for norconex-importer

413 results
for norconex-importer