norconex-importer Search Results

414 results
for norconex-importer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/crawlers #623

How to get page full page content?

I need to get full page content (with html tags) in my commiter. How I can do this? For now i geting just text, without html tags and other information Maybe exists some class which provide that…

Tsyklop updated 4 years ago
1
Norconex/crawlers #441

DOMTagger only for html but also index PDF and other Documen…

Hi, currently i have the situation that i want to only have the "main" content parsed in an html document. Like this: ```xml text/html ``…

tschechniker updated 4 years ago
1
Norconex/crawlers #659

Is it possible to store list of object in metadata?

I have List of object like this : ``` java { "url": "http:www.example.com/url1", "class": "classA" }, { "url": "http:www.example.com/url2", "class": "classB" }, { "url": "http:www.example.com/…

LeMoussel updated 4 years ago
4
Norconex/crawlers #646

canonicalLinkDetector rejects pages when <link rel="canonica…

Our website contains pages with ``, where "foo" is the url of the page itself. HTTP Collector 2.8.0 seems to reject these pages erroneously. I made this minimal config: ```xml /home/ron…

ronjakoi updated 4 years ago
7
Norconex/crawlers #454

Restricting the url extraction on limited date

First I want to thank you for this great collector Second I want to know is there is a filtering criteria to filter extracted urls on date or stop extraction when reach to date? what I see now is th…

AnwaarAshour updated 4 years ago
6
Norconex/collector-filesystem #24

Contents under zip are marked as deleted on the second run, …

Hi, I splitted the xml files generated for zip file using ` application/zip `. However when I again ran the program in which the zip file was already exi…

jayjamba updated 4 years ago
17
Norconex/crawlers #648

Form Auth failing with handshake failure

This is the latest stable release afaik. My config, not sure if I need these as well... ``` ./forescout/wiki-output/logs "FS HTTP Client" ht…

jacksonp2008 updated 5 years ago
1
Norconex/crawlers #640

custom 404 page content

Hi, we have a situation where instead of a default 404 error being returned, a custom page is rendered with a regular 200 status code. The only thing different is that the title has a 404 error in …

dtcyad1 updated 5 years ago
2
Norconex/collector-filesystem #47

File name having hash character ('#') in it is not crawled

Hi, I have put one simple text file having hash(#) in its name like ``#1.txt`` and when I try to crawl it using this path **smb://localhost/shared/test**, its not getting crawled. And when I try to c…

jayjamba updated 4 years ago
12
Norconex/crawlers #610

How to effectively follow links; Extract contents out of bin…

Hello Pascal, I have a list of about 180000 user profiles with each page navigation displaying 10 users profiles at a time that I want to index. Each user list navigation page is tagged as noindex,…

joettt updated 4 years ago
5

上一页 1...10 11 12 13 14 15 16...42 下一页

414 results for norconex-importer

414 results
for norconex-importer