norconex-importer Search Results

413 results
for norconex-importer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/collector-filesystem #49

Could not retreive SMB ACL data - ver 2.9.0 Snapshot

Got this error when the start path is network drive, eg. \\\\cap-index\c$\WISD\ using Branch: 2.9.0 snapshot. FYI, this error not found in 2.8.0 ``` FilesystemCrawler: 2019-05-17 11:25:55 ER…

truezjz updated 4 years ago
8
Norconex/importer #100

ExternalTransformer: INPUT gets corrupted

hello Pascal, I'd like to generate a thumbnail image for every incoming `document.contentFamily = image` using an `ExternalTransformer` script with ImageMagick tools. But it seems, the provided bi…

jetnet updated 4 years ago
5
Norconex/crawlers #583

question on Tagger

Hi Pascal, Is it possible to have the Copy Tagger used only if the toField is null or not present or just empty string. Case in point - i have a title that is null but i have the collector.referrer…

dtcyad1 updated 4 years ago
3
Norconex/crawlers #591

DOMSplitter

Using the dom splitter i extract ```` tags as new documents ```xml ``` and then i capture the attributes ```xml text/x-php text/plain …

niozasg updated 4 years ago
5
Norconex/crawlers #363

norconex-collector-http

shreya-singh-tech updated 4 years ago
26
Norconex/crawlers #361

(voluntary) Bad usage of "redirect" by website leads to no c…

Hi, I'm trying to crawl the following page: http://pubs.acs.org/doi/abs/10.1021/acschemneuro.7b00162 This page first redirects to: http://pubs.acs.org/doi/abs/10.1021/acschemneuro.7b00162?cookie…

liar666 updated 4 years ago
11
Norconex/crawlers #655

GenericMetadataFetcher & "HEAD - Method Not Allowed"

we crawl many sites using the same configuration templates and configured the `GenericMetadataFetcher` globaly for all sites. Some sites do not allow the `HEAD` request, and the crawler stops at the v…

jetnet updated 4 years ago
9
Norconex/crawlers #527

Canonical link handling when stayOnDomain=true

Hi, What is the expected behavior when you encounter a canonical link in a document which points to another domain, and you have stayOnDomain set to true? I'm seeing that the canonical link is fol…

github-il updated 4 years ago
3
Norconex/collector-filesystem #35

unnecessary extra title & dc:title named metadata are added

Hi, How can I suppress the title and dc:title added by norconex api from xml. I just want to include these tags which are derived from tika. Tika derived : `Competitive Landscape Overview for Fy…

jayjamba updated 4 years ago
7
Norconex/crawlers #441

DOMTagger only for html but also index PDF and other Documen…

Hi, currently i have the situation that i want to only have the "main" content parsed in an html document. Like this: ```xml text/html ``…

tschechniker updated 4 years ago
1

上一页 1...9 10 11 12 13 14 15...42 下一页

413 results for norconex-importer

413 results
for norconex-importer