Open z0ph opened 1 year ago
@z0ph Thanks for proposing this!
What action would you imagine us taking when a link is dead?
I don't think we'd want to "age out" accounts, so the link dying likely doesn't change validity of inclusion If we always want a valid link, I would generally prefer proactive automation (for example, archive.org'ing any link that is submitted) vs. reactive
Hey Rami,
Thanks for reviewing my PR.
My goal was to help maintainers of this repository/initiative with awareness of dead links that reference a specific AccountId.
We could probably construct an archive.org link to replace dead links on the fly. Let us decide this collectively. But at least we are now aware that some links are not working anymore.
I do not want to remove items because links no longer work, as folks probably still have trust policies in their environment for the related account IDs, and the purpose of this repo is to let people know what those are. There may be a need for us to identify some accounts as known but no longer valid
or various other categories (ex. known malicious
), but for now, I think we should just leave these. Some accounts may have new references, while others may be dead (ex. no one should have a trust relationship from my old Summit Route consulting business anymore). Mostly the links have been to ensure that at some point in time there was an admission by a company that they owned the account ID. As account IDs are not re-used, there shouldn't be any reason not to leave these. So for now, I think we should just ignore these.
@0xdabbad00 a few of the entries that you removed from the Permiso data-set had links, but those links no longer have the account ID. At one point they did, otherwise the link wouldn't have been in the original data-set. Do we want to include a crawled-at time indicating the time when the URL did have the account ID information?
Add tooling to identify dead links - First output: