omprakashrathi / crawler4j

Automatically exported from code.google.com/p/crawler4j
0 stars 0 forks source link

NullPointerException when trying to crawl different URLs #299

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Crawl one of the following:
http://lincoln.pioneer.kohalibrary.com/cgi-bin/koha/opac-search.pl?do=Search&idx
=isbn&q=0600313069

http://www.linkcat.info/ipac20/ipac.jsp?index=CISBN&term=8423997235

http://oasis.unisa.ac.za/search/i?SEARCH=9781853673801&searchscope=1

And many others cause NPE. 

and are skipped!

Original issue reported on code.google.com by avrah...@gmail.com on 1 Sep 2014 at 11:56

GoogleCodeExporter commented 9 years ago
Fixed in revision: 3aa3cd82b723 

It was caused due to encoding forcing of GZip !?

Original comment by avrah...@gmail.com on 1 Sep 2014 at 11:58