xrma / crawler4j

Automatically exported from code.google.com/p/crawler4j
0 stars 0 forks source link

link with space #173

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1.Crawl a site that contains a link like this: <a href="link with spache 
error.html">link with space</a>
2.
3.

What is the expected output? What do you see instead?
Expecting to see on shouldVisit, visit or handlePageStatusCode, but any of this 
handle the link.

What version of the product are you using?
3.3

Please provide any additional information below.

Original issue reported on code.google.com by li...@stoica.it on 17 Sep 2012 at 8:50

GoogleCodeExporter commented 9 years ago

Original comment by avrah...@gmail.com on 18 Aug 2014 at 3:27

GoogleCodeExporter commented 9 years ago
Checked it out and it works for me.

I have added a space before and after the link and it works.

Probably fixed since version 3.3

Crawler shows the link in shouldVisit and at visit

Original comment by avrah...@gmail.com on 21 Aug 2014 at 9:45