-
Hi ,
I am using HTTP collector and MYSQL committer along with document fetcher to crawl and index web pages. everything is working fine , however I have one requirement where I need to store start…
-
Related to a closed issue:
No matter what I try I keep getting the following exception:
Caused by: java.sql.SQLException: Incorrect table name 'document' Query: ALTER TABLE tablename ADD documen…
-
Hi Pascal,
it is so much fun to work with the Norconex Collector, thank you.
Well documented and easy fast results.
Is there a way to Split a metadata-attribute with multiple values into multip…
-
Hi Pascal,
I am seeing a lot of duplicates being processed. The number after the colon shows how many times i see it committed. This is all happening in the same run. How can I prevent this from ha…
-
Due to a bug in the shuyo detector sometimes the wrong language from the list of detected languages is chosen as `document.language`.
Test case:
```
#set($handler = "com.norconex.importer.handl…
-
I am getting the following errors while using SQL Committer V2. Can you please help?
1. java.lang.ClassCastException: java.lang.Long cannot be cast to java.lang.Integer
2. com.norconex.committer…
-
Hi Pascal,
I'm using 2.8.0 so if this issue has been addressed in 2.9.0, please let me know.
I'm trying to remove whitespace, new-lines, tabs from content. I found same/similar posting and trie…
-
Hi,
I have requirement while loading the StartURls from a Text File , I need to pass additional parameter like URLID, URLName, URL and City to the collector which needs to be stored in MYSQL table …
-
I have the following requirement:
1. To Crawl all the pages in a given URL , not the entire domain
Example : http://www.paihotels.com/the-president-hotel-jayanagar-bangalore/
The crawler should …
-
Hi,
It seems there's a regression with HttpCollector 2.8.0 (not reproduced in 2.7.1).
An Ajax crawl throws Null Pointer Exception :
```
java.lang.NullPointerException
at com.norconex.collector.h…