GSA / site-scanning

The central repository for the Site Scanning program
https://digital.gov/site-scanning
11 stars 2 forks source link

Do a fresh analysis of the omb/idea data #940

Open gbinal opened 2 months ago

gbinal commented 2 months ago

Take 4:

Specific sites for the below...

Take 3:

Requests for agencies to improve:

Take 2:

Areas sites could improve:

========

Along the lines of #838 Working here

Note the original cisa data here

gbinal commented 2 months ago

For the above analysis, putting this in a different issue.

23 sites refuse a connection (are blocking us?).

11 have the connection reset. It appears that most all of these are not live sites.

2 have invalid SSL certificates.

207 have a DNS resolution error. It appears to most all of these aren't live sites (at the exact target URL; in some cases, it's b/c www. is required or the like).

107 have unknown errors.

543 time out. Some substantial number of these are live sites, but in our experience meta or client-side redirects (e.g. HTML code for redirecting a site, instead of a server-side code) are often what our headless browser is failing on.