norconex-importer Search Results

413 results
for norconex-importer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/importer #41

TikaException: TIKA-198

There are a lot of this kind exceptions in log file. ``` com.norconex.importer.parser.DocumentParserException: org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.ap…

aleha84 updated 5 years ago
13
Norconex/importer #92

Support for including external script files in ScriptTagger

The current way of using `ScriptTagger` is like this: ```xml ``` Some of my scripts are longer than a few lines, so I thought it would be nice to have them in separate files. I tried …

ronjakoi updated 5 years ago
1
Norconex/committer-sql #5

Incorrect String Value while committing to MYSQL

Dear Mr. Paul, I am geting the following error while committing to MYSQL 5.7 version > Caused by: java.sql.SQLException: Incorrect string value: '\xF0\x9F\x91\x89 I...' for column 'content' at row…

HappyCustomers updated 5 years ago
2
Norconex/crawlers #531

Ignoring Links

This project has been very helpful, but I've got a roadblock that I can't seem to get around. I've been able to configure the crawler to authenticate against a site and then begin to crawl. However,…

RBBuff updated 5 years ago
14
Norconex/crawlers #545

PhantomJs for Fetching Dynamic Data

Hi Pascal, Ref : Norconex/importer: Issue No Import only certain text from HTML file #87 (https://github.com/Norconex/importer/issues/87 ) Based on your advice on using PhantomJS for fetching dy…

HappyCustomers updated 5 years ago
11
Norconex/crawlers #571

Extract tags from a field using DOM tagger

Hi, I am splitting an HTML document using DOMSplitter with img selector to extract what is in tags. After that I am trying to get some attributes like "alt:" and "src:" from "content" field (where…

niozasg updated 5 years ago
5
Norconex/crawlers #560

"handshake_failure" alert Exception (raised again)

Why this crawler configuration always return a "handshake_failure" alert and a java.net SSLHandshakeException ? ``` https://sapp2.formalazio.it/sapp/login Mozilla/5.0 (Windows NT 6.…

ciroppina updated 5 years ago
2
Norconex/crawlers #557

Repeatable crawler runs

Dear Sirs, I want to configure my in order to crawl starturls every 30 minutes. I tried using both the tags and the tag, but when the crawler job ends, the connector terminates. I would expect i…

ciroppina updated 5 years ago
2
Norconex/crawlers #479

multiple index entries for the same url

Hello, I am using the Norconex collector 2.8.0 to crawl my web sites. It is a great product and thank you for making it available open source. I want to have just one case-insensitive entry…

SolSearch updated 5 years ago
15
Norconex/importer #93

Links appearing in pdf documents

Hi, When parsing pdf documents which contain hyperlinks, the links end up in the extracted content. I'm using http_collector 2.8.1 and a simple pdf document (created from word) which has the wor…

ghost updated 5 years ago
3

上一页 1...12 13 14 15 16 17 18...42 下一页

413 results for norconex-importer

413 results
for norconex-importer