GSA / site-scanning

The central repository for the Site Scanning program
https://digital.gov/site-scanning
11 stars 2 forks source link

[rewrite this issue or create a new for prototyping] consider a 'Canonicalization'/"naked domain" field #1003

Closed gbinal closed 1 month ago

gbinal commented 1 month ago

Does X URL redirect to www or vice versa, etc?

Note - https://digital.va.gov/web-governance/?_search=naked#awb-oc__377

gbinal commented 1 month ago

When CISA scans x.gov, they scan:

but they talk about the domain just as x.gov.

In any number of websites x.gov or y.z.gov resolves as www.x.gov or www.y.z.gov. But in other cases, www.x.gov or www.y.z.gov resolves as x.gov or y.z.gov. Sometimes, both resolve and don't redirect.

Put another way, for a URL, it can be thought of as:

URL Status
x.gov resolves

is there a good way to represent ^^^ as a data field?

One idea is that all target URLs have their www. removed.

gbinal commented 1 month ago
gbinal commented 1 month ago

If we had this data, we could:

gbinal commented 1 month ago

done - closing this in favor of #1017