censoredplanet / censoredplanet-analysis

Analysis of the CensoredPlanet data.
Apache License 2.0
14 stars 4 forks source link

Filter non-https domains #233

Closed ohnorobo closed 1 year ago

ohnorobo commented 1 year ago

Remove any domains which don't have at least one HTTPS cert match/valid-ca in a given scan.

We don't have to know exactly why any given domain is failing (long dead, bad cert that day, fat-finger DNS, never supported HTTPS, etc) but if we can't confirm one good cert than we have to remove it.

This is possible now because we are also getting the page fetches for control answers here.