asepaprianto / crawler4j

Automatically exported from code.google.com/p/crawler4j
0 stars 0 forks source link

charsetName NullPointer exception #145

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
00:39:00,979 INFO  ~ Indexing page 
http://www.sagepub.com/books/Book235204?fs=1&sortBy=defaultPubDate%20desc&subjec
t=H00 [114]
java.lang.NullPointerException: charsetName
    at java.lang.String.<init>(String.java:449)
    at java.lang.String.<init>(String.java:523)
    at edu.uci.ics.crawler4j.parser.Parser.parse(Parser.java:66)
    at edu.uci.ics.crawler4j.crawler.WebCrawler.processPage(WebCrawler.java:276)
    at edu.uci.ics.crawler4j.crawler.WebCrawler.run(WebCrawler.java:189)
    at java.lang.Thread.run(Thread.java:679)

Original issue reported on code.google.com by tahs...@trademango.com on 6 Apr 2012 at 12:42

GoogleCodeExporter commented 9 years ago
Attached patch should fix the above bug

Original comment by tahs...@trademango.com on 6 Apr 2012 at 12:52

Attachments:

GoogleCodeExporter commented 9 years ago
Hi,

Is there any reason this hasn't been integrated in to trunk?
I can replicate the NPE as well crawling the following URL
http://people.apache.org/~lewismc/rover-test1.nq

This looks like a nice clean patch.

Original comment by lewis.mc...@gmail.com on 11 May 2012 at 11:37

GoogleCodeExporter commented 9 years ago
Hey, can someone please integrate the patch in the trunk? 

Original comment by ashwa...@indix.com on 1 Apr 2013 at 10:54

GoogleCodeExporter commented 9 years ago
Integrated in this change: 
https://code.google.com/p/crawler4j/source/detail?r=e14a296409390eaba34108481b2c
e779e0d99bbf

Original comment by ganjisaffar@gmail.com on 8 Apr 2013 at 4:28