norconex-importer Search Results

414 results
for norconex-importer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Norconex/crawlers #118

`Read timed out` in some channels

Hey, i've an issue with Read time outs. For some channels it works perfectly but for others not. It only happen from time to time. I have no request in the log file for that time, so the crawler did …

schwipee updated 9 years ago
17
Norconex/crawlers #166

Allow changing character case of field names

_Request created from @comschmid comment in issue #163._ Allows to change the character case of field names, like `CharacterCaseTagger` does for field values.

essiembre updated 9 years ago
9
Norconex/crawlers #141

The crawler doesn't extract URLs from www.feccoo-extremadura…

Crawling `www.feccoo-extremadura.org`, I get just one document, the one for the domain, but navigating with a browser, it automatically converts to `www.feccoo-extremadura.org/ensenanzaextremadura` an…

csaezl updated 9 years ago
5
Norconex/crawlers #122

Fawlty links causes Norconex to throw pages away

I encountered a page where the link " was present. It is obviously a fawlty designed URL. However, when encountering this URL, Norconex discards the current page with it, throwing the following stack …

Betongsuggan updated 9 years ago
3
Norconex/crawlers #145

Processing pictures in a flickr site

Crawling a flickr site, say, `https://www.flickr.com/photos/gobiernoextremadura` with: ``` https://www\.flickr\.com/photos/gobiernoextremadura/.* ``` I only get 3 documents: ``` …

csaezl updated 9 years ago
23
Norconex/crawlers #100

Unable to find valid certification path to requested target

This error happens with the seed URL for the site, so no document in the site is processed. What can I do? ``` MC(crawler): 2015-05-05 18:57:27 ERROR - Cannot fetch sitemap: http://valitsus.ee/sitema…

csaezl updated 9 years ago
21
Norconex/crawlers #119

Javascript generated URLs

I tried to crawl a site and get following error in log: ``` site: 2015-06-10 21:38:12 DEBUG - ACCEPTED document reference. Reference=http://www.site.com/Projects/c2c/channel/images/'+L140413[1+Math.r…

AntonioAmore updated 9 years ago
6
Norconex/committer-core #10

AbstractFileQueueCommitter committing 1 file, but AbstractCr…

I found a random behavior in the Committer. Running a crawl against the same url will give different results. Here is a section of output when the behavior is correct ``` INFO - AbstractCrawler …

yvesnyc updated 9 years ago
4
Norconex/crawlers #108

Filter usage question

From @madsbrydegaard, moved from https://github.com/Norconex/collector-http/issues/48#issuecomment-101662531: I tried implementing the filter option: ``` .\bkeyword\b. ``` However pages witho…

essiembre updated 9 years ago
12
Norconex/crawlers #85

OOM with snapshot 2.2.0

Only seeing this in console, not log file. Occured since using snapshot. Exception in thread "pool-1-thread-2" java.lang.OutOfMemoryError: Java heap space at java.lang.AbstractStringBuilder.(…

OkkeKlein updated 9 years ago
9

上一页 1...34 35 36 37 38 39 40...42 下一页

414 results for norconex-importer

414 results
for norconex-importer