Maxime2 / dataparksearch

An open source, feature rich search engine.
http://www.dataparksearch.org/
GNU General Public License v2.0
95 stars 45 forks source link

IPv6 URL support #3

Open magik6k opened 9 years ago

magik6k commented 9 years ago

I am using this engine to index modarate-size IPv6 only network(using squid proxy) where some services don't have domain names and instead use raw IPv6 URLs which look like http://[2001:db8:85a3:8d3:1319:8a2e:370:7348]/. Problem is that Dataparksearch seems to cut the url after first : character. This issue is only about IPv6 URL support, not full IPv6 support.

Here's how the URLs look in database: Database view

Maxime2 commented 9 years ago

DataparkSearch Engine doesn't support IPv6 at all, thus it treats colon (":") in URL as separator between host name and port number. I agree it would better to add support for literal IPv6 addresses as such pages could be indexed via a proxy even as DataparkSearch can not crawl them directly. Though I can not say when such support would be implemented, sorry.