-
```
What steps will reproduce the problem?
java.lang.NullPointerException
at edu.uci.ics.crawler4j.frontier.DocIDServer.getDocID(DocIDServer.java:70)
at edu.uci.ics.crawler4j.crawler.WebCrawle…
-
```
00:39:00,979 INFO ~ Indexing page
http://www.sagepub.com/books/Book235204?fs=1&sortBy=defaultPubDate%20desc&subjec
t=H00 [114]
java.lang.NullPointerException: charsetName
at java.lang.String…
-
```
Output.....
http://robpaveza.net/
Missing method .ctor in assembly /home/steven/Desktop/Debug/Abot.dll, type
System.Dynamic.ExpandoObject
Unhandled Exception: System.IO.FileNotFoundException: C…
-
```
We should add better hooks in the WebCrawler in which we could better control
various errors while crawling a certain URL.
```
Original issue reported on code.google.com by `avrah...@gmail.com` …
-
```
What steps will reproduce the problem?
1. Crawl the site www.imdb.com with the example from the site
2.
3.
What is the expected output? What do you see instead?
I should not see any errors. Inste…
-
```
While crawling the seed http://eventiesagre.it/ I obtain the internal error
reported below.
I guess the issue is due the crawler finds a URL without a final / .
Processing page: [http://eventies…
-
```
What steps will reproduce the problem?
Use webcrawler to a url as seed that is down. Now, for example,
http://jquery.com/ is not working and webcrawler is freeze if I use this page.
What is the…
-
```
What steps will reproduce the problem?
1. use an example of http://vimeo.com/search?q=lectures as seed url
What is the expected output? What do you see instead?
Links to pages that are of type a…
-
```
What steps will reproduce the problem?
1. Take the simple crawler example; remove all calls to controller.addSeed()
and replace with this one
controller.addSeed("http://dairymix.com/");
2. This …
-
```
What steps will reproduce the problem?
1. I am using a list of urls to start many concurrent web crawlers using your
libraries, I was very keen on synchronization issues and so on.
2. Run the cra…