-
When trying to view https://arquivo.pt/wayback/20220916102735/https://www.publico.pt/ , after a while the replay breaks:
![image](https://user-images.githubusercontent.com/79858044/193834236-68d19a…
-
**Is your question related to a problem or code? Please describe.**
My issue is that after reading in the changelog that you can now archive (and preserve) non-HTML content, I have added a PDF link w…
-
I can archive a page manually by clicking "Archive Current Page", I can see it in my archivebox instance, but it does not send archive everything I visit automatically. I see no entries of visited web…
-
## Context
A prototype, we are making automatic archives via internet archive in this repository.
## Long term strategy
For long-term stability, we should use our own archival solution, distr…
-
Hello,
I am Nurullah from [HTTP Archive](https://github.com/HTTPArchive), and we are planning to use Topics API model to categorize webpages for [the 2024 Web Almanac ](https://github.com/HTTPArchi…
nrllh updated
1 month ago
-
is it possible to be able to get this in Zim file format to use with https://kiwix.org/en/
this is an ofline internet project which enable for the creation of zim files an archive which can be browse…
-
The wiki links to old dansguardian documentation appears to be broken. The webpages appear not to exist any longer. The links should be replaced with links to archived dansguardian documentation or to…
-
OSF is seeking to provide an example of best-practices defined by the COL. One of those best-practices is "archving".
* [ ] What does COL archiving entail?
The two targets for archiving I fee…
-
On content-rich webpages the algorithm does not seem to terminate, leading to a deadlock which has to be interrupted. See adbar/trafilatura#189
Here is an archived version of the page where the pro…
adbar updated
2 years ago
-
This is specifically necessary when many webpages are archived.