chainguard-dev / rumble

Data collection for base image CVEs etc.
Apache License 2.0
3 stars 8 forks source link

Filter unwanted images from scan targets #85

Closed luhring closed 1 year ago

luhring commented 1 year ago

Some images shouldn't be scanned because they are no longer being built. Including these images in Rumble's output distorts downstream metrics and adds noise to the dataset.

Today, Rumble derives its list of images to scan using a chainctl command. This command lets us dynamically look up images of interest; however, today it's also returning a few of the problematic images. We hope to do some related cleanup work that will result in chainctl not returning the problematic images.

Until that's ready, this PR cleans up the mess on the Rumble side and downstream of Rumble's processing. Once the upstream cleanup is finished, we can revert this PR!

cc: @jspeed-meyers @jdolitsky @imjasonh @mattmoor