-
I'm trying to see if I can integrate `savepagenow` into my election night scraping system. The idea would be to save online results files into the Wayback Machine when my system detects the results ha…
-
The Internet Archive import script(s) (`wm import ia` and `wm import ia-known-pages`) should have an option that causes them to upload Mementos to S3:
```sh
$ wm import ia 'http://www.epa.gov/' --…
-
This is a new feature idea to make use of the Wayback Machine's Changes API.
The idea is to compare the latest saved snapshot of the currently viewed website, to a snapshot that was stored in the p…
-
Old news: LR35902 is the part number of the Game Boy system on chip.
New news: LR359 prefix appears in the SoC part numbers of the whole line of handhelds, Game Boy through Nintendo 3DS. Relevant par…
-
No works anymore, all time I try (for any site) I receive the same error:
_C:/Ruby31-x64/lib/ruby/3.1.0/net/http.rb:1018:in `initialize': Nenhuma conexÒo p¶de ser feita porque a mßquina de destino …
-
### The Problem
The idea for this feature stems from noticing that the Wayback Machine can't always save pages due to measures that websites take to block web crawlers, but which the user is able t…
-
Thanks, archive.org, for solving a big part of my digital-media management issues!
- [Fix You in memory of Aaron Swartz : K, K, and B : Free Download, Borrow, and Streaming : Internet Archive](htt…
dckc updated
8 months ago
-
I used this great tool to download the site http://web.archive.org/web/20230713110210/http://users.tpg.com.au/jpwbeest/. At first glance everything went well, but then I found out that some downloaded…
-
## Project description
Wayback machine is a great resource but sometimes it doesn't have a complete archive of a website and it doesnt crawl all those little websites where some gems might be hidden,…
-
I’ve been digging in to the link checker on [ma.tt](http://ma.tt/). Here are some quick thoughts:
1. We need to ignore share links. There are like 7 of them on each page and they all block robots, …