shadisabzali / dataparksearch

Automatically exported from code.google.com/p/dataparksearch
GNU General Public License v2.0
0 stars 0 forks source link

Indexer does not correctly handles absolute href without protocol type specification #37

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago

1. Let's start indexing from http://www.yandex.ru/
2. Indexer tries to fetch http://www.yandex.ru/yandex.st/.... But urls in 
source looks like <a href="//yandex.st/**"

It should replace //yandex.st/*** to http://yandex.sy/*** where http is the 
current URL's protocol.

Original issue reported on code.google.com by b37hr3...@gmail.com on 6 Apr 2011 at 10:52

GoogleCodeExporter commented 9 years ago
This fixed in current SVN trunk and will be available in the next snapshot.

Thank you for reporting.

Original comment by dp.max...@gmail.com on 6 Apr 2011 at 1:12