GSA / site-scanning

The central repository for the Site Scanning program
https://digital.gov/site-scanning
11 stars 2 forks source link

Checklist for new fields - a11y #652

Open gbinal opened 12 months ago

gbinal commented 12 months ago

sample data

gbinal commented 12 months ago

The field names and their locations should be:

gbinal commented 11 months ago

Alright - I'm comparing the old and new data here.

Of our 15,104 websites in a current primary snapshot, 11,710 match with an entry from the 12,970 websites in the itdashboard export.

A bigger issue is that 3,416 of the 11,710 have no scan results from the Site Scanning data because the scan failed. Note that in each of those cases though, the primary scan completed.

As a next step, we should investigate and try to remedy that high scan failure rate and then compare the data afresh.

Why do they have sites that we don't?

gbinal commented 10 months ago

Take 2 at comparing the data now that #672 is done...

New scan data | Snapshot of ITDashboard data | Analysis

12,905 in itdashboard, 14,988 from Site Scanning (2083 more). 2,024 had scans fail.

There's 11,693 that overlap between the two lists.

gbinal commented 10 months ago

Take 3:

New scan data | Snapshot of ITDB | Analysis

12,905 in itdashboard | 15,000 in site scanning (457 failed)| 3366 don't overlap

dup link?