gigablast / open-source-search-engine

Nov 20 2017 -- A distributed open source search engine and spider/crawler written in C/C++ for Linux on Intel/AMD. From gigablast dot com, which has binaries for download. See the README.md file at the very bottom of this page for instructions.
Apache License 2.0
1.54k stars 440 forks source link

Doesn't recognize new gTLD's #56

Open isj-privacore opened 9 years ago

isj-privacore commented 9 years ago

Domains.cpp has a hardcoded list of TLDs. The list is incomplete. The new gTLDs includes .wiki .football .bar etc. So maintaining a hardcoded list seems a dead-end (imho).

gigablast commented 9 years ago

this is true. when it was developed originally the TLDs are a lot more limited than they are now. so we'll need a fix for this, although i don't see a ton of new TLDs deviating from the original list lately.

On 09/14/2015 08:20 AM, Ivan Skytte Jørgensen wrote:

Domains.cpp has a hardcoded list of TLDs. The list is incomplete. The new gTLDs includes .wiki .football .bar etc. So maintaining a hardcoded list seems a dead-end (imho).

— Reply to this email directly or view it on GitHub https://github.com/gigablast/open-source-search-engine/issues/56.

WARNING: CONFIDENTIALITY NOTICE: This E-mail and the materials attached are the private confidential property of the sender, and the message and attachments are privileged communications intended solely for the receipt, use, benefit, and information of the intended recipient indicated above. If you are not the intended recipient, you are hereby notified that any review, disclosure, copying, distribution, or the taking of any other action in reliance on the contents of this transmission is strictly prohibited, and may result in legal liability on your part. If you have received this transmission in error, please notify the sender immediately by replying to the sender, then fully delete the transmission from your computer and destroy any copies hereof. Your cooperation is appreciated.