-
I am experiencing a strange problem with the HTTP Collector v3 RC1, which could be a bug.
This is an example config based on the minimal setup included in the examples folder of Norconex v3 RC1:
…
-
It would be great to do some extra work upon deletion process
![image](https://user-images.githubusercontent.com/15138506/75683302-c47f2600-5c8e-11ea-938e-cf0ccece3e6f.png)
Thanks a lot!
-
Hi,
I don't get the ReplaceTransformer to work in the Norconex 3.0.0-SNAPSHOT (2021-12-20). Either I am missing something in the configuration or it just does not have any effect on the content fie…
-
Hi,
We have noticed that sometimes Norconex committer fails to index few documents for any reason. These failures cannot be communicated back to the crawl which updates the checksum and will not be p…
-
Hello,
I am crawling a website, where some entries in the sitemap will have images like so:
```
https://example.com/about
2021-01-28T16:11:08+01:00
weekly
0.7
…
-
In https://github.com/Norconex/collector-http/issues/477, I diagnosed a serial problem where my crawling job experienced a Fatal `OutOfMemoryError`, and then later an attempt to stop the collector fai…
-
Hi All,
I have Norconex HTTP collector configured with Solr Committer. But I updated standalone solr with solr cloud and not sure how I can connect? Below is the xml file where solr committer is co…
-
@LeMoussel @essiembre Thanks, I would be interested to see that as I might have to write a committer myself, as I have to find a way to send crawled docs to temporary storage for further processing wh…
-
Hello
We are using norconex since months but since 1 Week we are not able to to commit the documents...
We are seeing 2 errors in the logs first one is a certificate issue and the second one is …
-
On a site with sitemap path specified in robots.txt Norconex doesn't recognize this specification.
The SitemapResolverFactory is configured to respect only specifications from robots.txt by setting t…