GSA / site-scanning

The central repository for the Site Scanning program
https://digital.gov/site-scanning
11 stars 2 forks source link

unable to force certain URLs into index #1113

Closed gbinal closed 1 week ago

gbinal commented 2 weeks ago

The following 12 URLs should all be on the ignore-except list since they are in the omb-idea list and we need to force them in. Yet I don't see them in the index file.

akadev-ion-hhs.gov akaprod-foodsafety.gov akaprod-stopbullying.gov akaqa-ion-hhs.gov foiaonline.gov in.gov lib-lanl.gov listserv.sos.wa.gov origin-acquisition.gov origin-gsa.gov saferfederalworkforce.gov stg-foia.gov

gbinal commented 2 weeks ago

Alright - @luke-at-flexion figured this out for me! At issue is that the filtering stage that removes any URLs that aren't on the .gov or .mil registry list comes after the ignore-excepted list is added in. And all of these are not on that list.

So, maybe this is actually as it should be. Will think on this and maybe not change anything after all.

Thanks, @luke-at-flexion!!!