etalab / noms-de-domaine-organismes-secteur-public

Liste de noms de domaine d'organismes publics
Other
23 stars 15 forks source link

Tenter une collecte à partir des certificate transparency logs #15

Closed JulienPalard closed 2 years ago

JulienPalard commented 2 years ago

crt.sh expose un postgresql public des Certificate Transparency logs (psql -h crt.sh certwatch guest), en partant de là on peut tenter deux choses :

JulienPalard commented 2 years ago

I worked in #18 with:

SELECT id, x509_notBefore(certificate), x509_altnames(certificate) FROM certificate WHERE plainto_tsquery('gouv.fr') @@ identities(certificate) AND id > 5500000000;

It yields 7k+ domains, up to id 5908982198, but only 1224 were accessible over HTTP.

JulienPalard commented 2 years ago

Did another one, it works well, so I'm closing this issue. I "documented" the procedure in the commit message.

bzg commented 2 years ago

Thanks!