-
I have Apache Solr7.2 installed in SolrCloud mode. I have setup 4 collections, to index html pages, images & videos in separate collection. I was using Norconex to index the content from the web site…
-
I have 2 jobs, A and B. I already know B will fail during indexing (to Solr). When B inevitably fails, A fails as well, provided, A finishes _after_ B.
If I run A by itself, it completes successfu…
-
I am getting the following errors while using SQL Committer V2. Can you please help?
1. java.lang.ClassCastException: java.lang.Long cannot be cast to java.lang.Integer
2. com.norconex.committer…
-
Is there any sample Java code/project available to call the crawler and solr committer? We are developing a Java application and we would like to call the norconex collector and committer from our j…
-
I noticed that the SourceForge account for JTS is still open. It appears above GitHub in some searches and continues to see a lot of downloads and occasional bug reports.
-
I have the following requirement:
1. To Crawl all the pages in a given URL , not the entire domain
Example : http://www.paihotels.com/the-president-hotel-jayanagar-bangalore/
The crawler should …
-
I need to extract only a certain type of files from a repository, for example the .pdf, ppt, ... I am using this configuration but it does not work.
```xml
#set($http = "com.norconex.collect…
-
**Description:**
With Publisher access control feature included, APIs which are created before the feature addition will go invisible. Due to making solr query which should have done only if the pu…
-
**Description:**
currently when building the product distribution, it changes the path at [1] but as the proper fix need to change the product structure to contain the 'solr' directory.
[1] https:…
-
Hi!
I have started using this crawl and I'm trying to do something that I don't know if it is feasible.
I have several roles on the site, so I want to crawl the site with every role and commit the…