tasfe / crawler4j

Automatically exported from code.google.com/p/crawler4j
0 stars 1 forks source link

Parsing of urls with # broken #224

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?

1. Call getCanonicalURL( "http://twitter.com/#!/digitalpixels" ) and the result 
is http://twitter.com

What is the expected output? What do you see instead?

Should be: http://twitter.com/#!/digitalpixels

What version of the product are you using?

3.5

Please provide any additional information below.

See: 

http://stackoverflow.com/questions/3009380/whats-the-shebang-hashbang-in-faceboo
k-and-new-twitter-urls-for

https://developers.google.com/webmasters/ajax-crawling/docs/getting-started

Original issue reported on code.google.com by mark.smu...@gmail.com on 7 Jun 2013 at 4:53

GoogleCodeExporter commented 9 years ago

Original comment by avrah...@gmail.com on 18 Aug 2014 at 3:40