openrightsgroup / blocked-org-uk

Template front-end code, markup, style-sheets, images and other assets for the Censorship Monitoring Project (blocked.org.uk)
https://www.blocked.org.uk/
GNU General Public License v3.0
13 stars 5 forks source link

Pull category data from Categorify #342

Closed gwire closed 5 years ago

gwire commented 6 years ago

There's a DNS based web filter service, CleanBrowsing (which launched about 10 months ago), which makes its categorisation information available via an API at Categorify. It's a little light on the actual Terms of Use, but it might be worth taking a look at.

$ curl -s https://categorify.org/api?website=google.com | json_pp -f json -json_opt pretty
{
   "keyword_heatmap" : {
      "policy" : 32,
      "service" : 26,
      "share" : 20,
      "privacy" : 62,
      "services" : 123,
      "information" : 124,
      "example" : 77,
      "content" : 31,
      "account" : 53,
      "using" : 23,
      "play" : 34,
      "collect" : 23,
      "data" : 24,
      "started" : 31,
      "including" : 27,
      "search" : 33,
      "learn" : 71,
      "personal" : 30,
      "store" : 37,
      "google" : 276
   },
   "ip" : "172.217.5.238",
   "rating" : {
      "nudity" : false,
      "description" : "Safe for all audiences.",
      "language" : false,
      "value" : "PG",
      "violence" : false,
      "adult" : false
   },
   "country" : "United States",
   "country-code" : "US",
   "category" : [
      "Search Engine",
      "Clean Browsing"
   ],
   "domain" : "google.com"
}
JimKillock commented 5 years ago

Looks interesting

dantheta commented 5 years ago

Running now for web submissions and requeued tests