-
Create a tagger that allows extracting both the field names and field values as pairs.
The following is a use case describing the requirement, reported here: https://github.com/Norconex/collector-…
-
I originally wrote a crawler with:
```
https://(find|openresearch-repository|digitalcollections)[.]anu[.]edu[.]au/(handle/[0-9]+/1/)simple-search([?].*)?
```
Running…
-
I am trying to set up a norconex connector for a site
and my issue is that the URLs under the div portion is not getting crawled.
Attaching the configuration code here:-
```xml
#set($http…
-
-
[meta.txt](https://github.com/Norconex/collector-http/files/1229822/meta.txt)
[config.txt](https://github.com/Norconex/collector-http/files/1229819/config.txt)
Hi Pascal, I have a question abo…
-
hi there
I have a question regarding the importer, Is it possible to limit the content size of a File, I am having issues with a some large files in MS-Excel, and I would like to just index a couple …
-
Hi there
i am trying to crawl a website with several file types and I have to strips before and after, and when I hit some file not application/HTML I am getting an error, is it possible to apply st…
-
Hi,
The `jsoup` api version 1.7.2 (pulled from `tika-parsers` dependency) causes NoSuchMethodError (but works well with `jsoup` 1.9.1) :
```
ERROR - AbstractCrawler - 3_3: Could not pr…
-
Hi,
I've written a quite complicated crawler which shows a strange behaviour. I've tried to reduce its size to make it more readable. You'll find its code attached.
The problem is the following:…
-
I'm getting the following error when I run my committer code:
Caused by: org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://localhost:8983/solr/uvahea…
dkh7m updated
7 years ago