internetarchive / wayback

IA's public Wayback Machine (moved from SourceForge)
742 stars 134 forks source link

List sites that are excluded from Wayback Machine #228

Open bruceleerabbit opened 3 years ago

bruceleerabbit commented 3 years ago

Sorry.  This URL has been excluded from the Wayback Machine

Sites like Quora have excluded themselves from wayback machine. And worse, some of those sites block direct access also (Quora blocks all Tor traffic). This means users are lead on a chase for an unreachable document.

The fix: Please make the list of hosts that are excluded from the Wayback Machine (WBM) easily accessible. This would enable search engines to exclude such time-wasting sites from search results, or to at least give them a lower ranking. It would also make it possible to create browser plugins that will flag such sites in search results of mainstream search engines, so that users know to avoid them.

Excluded websites also create a problem for bloggers and journalists. We take the time to write a well-cited article, and then a link goes dead. The remedy is to substitute the dead link with a WBM-mirror, which could even be automated. But when WBM is blocked, there's no recourse. Part of our article loses credibility or becomes useless. We would rather not spend the time drafting an article that relies on such unreliable sources.

Note this enhancement is similar but complemtary to this bug: https://github.com/internetarchive/wayback/issues/199

Satoshi0x commented 1 year ago

Use ardrive.io and the ardrive chrome extension to archive each of your blog posts you want saved :) The future is here and its cheap bc they faucet you tokens that last for like thousands of pages.

https://a4kajdbiof5xztkgdyydsu2ibdixdf2nf4zscoa7nvrt6dzavgoq.arweave.net/BxQEjChxe3zNRh4wOVNICNFxl00vMyE4H21jPw8gqZ0

Or... even do better and use Keybase's KBFS (KeyBaseFIleSystem) download the index.html from archive.ph where they had the blog post that was from 2014 and showing the same message on archive.org and you can have it look like this on your own immutable E2E encrypted front end for the index.html you store in your filebase. If you have windows you just run keybase app, go to file explorer you'll see a (K:) drive for KBFS add the index.html you downloaded and then go to https://satoshi0x.keybase.pub/ that is [yourusername].keybase.pub/ to view the perma stored KBFS front end of that blog. :)