commonsearch / cosr-back

Backend of Common Search. Analyses webpages and sends them to the index.
https://about.commonsearch.org
Apache License 2.0
123 stars 24 forks source link

Import Blekko slashtag data #23

Open sylvinus opened 8 years ago

sylvinus commented 8 years ago

As Greg Lindhal (@wumpus) pointed out, DMOZ's data is rather low-quality these days, so it could be great to add presence in https://github.com/blekko/slashtag-data as another signal.

This should be pretty straightforward to do in the code, by duplicating what is currently done with DMOZ.

Is there an explicit license to this data though?

indolering commented 8 years ago

Is there an explicit license to this data though?

They specifically state that there is no copyright on the data:

We believe that the data in this repo is not copyrightable. As far as blekko so concerned, you can use it in any way you wish.

indolering commented 8 years ago

You could also ask DDG for their bang data....