norconex-importer Search Results

413 results
for norconex-importer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/crawlers #446

Cannot fetch document: Handshake Failure

I have the exact same configuration for another domain, just swapping out the domain specific stuff. I get the following error when trying to run the indexer. ERROR [GenericDocumentFetcher] Cannot…

jamieshiz updated 5 years ago
7
Norconex/importer #86

preParse / postParse script filter weirdness

Hello Pascal, I'm struggling with a weird script filter issue. Even the most trivial script filter just returning `true` results in a `REJECTED_IMPORT` when used as `postParseHandler`, where the…

Pittiplatsch updated 5 years ago
7
github/securitylab #38

Java (Maven): Actually fix the use of insecure protocol to d…

[![mitm_build](https://user-images.githubusercontent.com/1323708/59226671-90645200-8ba1-11e9-8ab3-39292bef99e9.jpeg)](https://medium.com/@jonathan.leitschuh/want-to-take-over-the-java-ecosystem-all-yo…

JLLeitschuh updated 4 years ago
10
Norconex/crawlers #554

Connection Refused when trying to crawl a certain website

Dear Sirs, I am successfully crawling dozen websites with the 2.8.1 Collector-Http, successfully sending/committing contents to my Solr7.5.0 schema But a (Italian) website always returns Connect…

ciroppina updated 5 years ago
4
Norconex/importer #88

Change content field based on contentType

Hi there, I'm trying to change the content that will be committed based on the contentType. That means, I'm trying to submit the "description"-Field for PDF files, rather than the original content…

mnemonictrick updated 5 years ago
3
Norconex/crawlers #541

Multiple start URLs

Hi, From my main site http://www.test.com, i only need to crawl certain parts. So I configured like this: ```xml https://www.test.com/deptA https://www.test.com/deptD ``…

dtcyad1 updated 5 years ago
5
Norconex/crawlers #540

URL interpret issue

Hi, I have several urls with this format. Is the hypen causing the issue? How do i get around it? I cannot change the url formats. ``` website_test: 2018-12-02 18:56:08 INFO - DOCUMENT_F…

dtcyad1 updated 5 years ago
9
Norconex/importer #87

Import only certain text from HTML file

Hi, I want to import only certain data from the webpage which I am crawling. This data exists between the body tag of the HTML page ```xml The ACME Business Bangalore www.acme.com …

HappyCustomers updated 5 years ago
7
Norconex/crawlers #537

Append or suffix string to value of metadata

Hi, I want to suffix the value of one metadata field. I tried using com.norconex.importer.handler.transformer.impl.ReplaceTransformer in the preParseHandlers. For ex. title="test" to get transform…

kristiWabion updated 5 years ago
1
Norconex/crawlers #514

How to store Start URLs

Hi , I am using HTTP collector and MYSQL committer along with document fetcher to crawl and index web pages. everything is working fine , however I have one requirement where I need to store start…

HappyCustomers updated 5 years ago
12

上一页 1...13 14 15 16 17 18 19...42 下一页

413 results for norconex-importer

413 results
for norconex-importer