Open gbinal opened 12 months ago
The field names and their locations should be:
Alright - I'm comparing the old and new data here.
Of our 15,104 websites in a current primary snapshot, 11,710 match with an entry from the 12,970 websites in the itdashboard export.
A bigger issue is that 3,416 of the 11,710 have no scan results from the Site Scanning data because the scan failed. Note that in each of those cases though, the primary scan completed.
As a next step, we should investigate and try to remedy that high scan failure rate and then compare the data afresh.
Why do they have sites that we don't?
Take 2 at comparing the data now that #672 is done...
New scan data | Snapshot of ITDashboard data | Analysis
12,905 in itdashboard, 14,988 from Site Scanning (2083 more). 2,024 had scans fail.
There's 11,693 that overlap between the two lists.
10,440 match image alt text, 1,253 differ.
8.968 match html alt text; 2,725 differ.
9,579 match color contrast; 2,114 differ.
10,440 of 14,988 match image alt text; 1,955 differ; 2,603 N/A (because in SS but not ITD).
Take 3:
New scan data | Snapshot of ITDB | Analysis
12,905 in itdashboard | 15,000 in site scanning (457 failed)| 3366 don't overlap
sample data