-
An engine that interfaces with the Internet Archive's search API could be useful as a corner case, i.e., with shortcuts only to search for historical sites.
-
URL's should be preserved as soon as they are found.
Reference code:
**[pragma.archivelab.org/pragma/api/pragmas.py](https://github.com/ArchiveLabs/pragma.archivelab.org/blob/7587f6de4cd380bfb10…
-
I’m working on some tools for using the [Internet Archive’s Wayback](https://web.archive.org) Machine APIs and it’d be really helpful if I could write a custom `Retry` class that can look at the the r…
-
ClearURLs are a set of rules to strip URLs of tracking elements from known websites. This would help protect the user's privacy.
We'd like to add this to General Settings ("Use Clean URLs") which w…
-
I am trying to recover a website that was created for a non-profit organization with WordPress. It was hosted on a third-party site but the organization has lost its admin access and somehow broke the…
-
### Description
The normal getter for segments hides those which are voted negativ etc.
I wanted a method to get all segments of a video, regardless of votes.
This could be achieved by adding a s…
-
Does this still work?
https://github.com/crimethinc/website/issues/451
Let's find out and either close this with no additional effort needed or let's fix the thing so that archive.org always has…
-
Hi,
I am getting the following error both from python and cli usage. archive.cli is working fine though.
Error (The Internet Archive): 502 Server Error: Bad Gateway for url: https://web.archive.…
-
So I know many people have trouble due to the fact that MyAnimeLIst no longer whitelists new IP addresses from their antiDDOS software which leads to many people struggling to scrape data of the websi…
-
Just asking. Examples: TikTok, Twitch, Vimeo, Dailymotion, Twitter, Reddit, etc.