Mitchellpkt / auto_repo_archiver

Uses the Wayback Machine to archive GitHub repos mentioned in arXiv writeups
0 stars 0 forks source link

auto_repo_archiver

!! Work in progress !!

Given a keyword,

(1) pulls down related arXiv papers

(2) scans each page of each paper for GitHub repository URLs

(3) triggers the Wayback Machine to archive the repository, IF AND ONLY IF it has not already been archived

This is a hacky prototype, but I think the core functionality works. (note that at this time it only snapshots the repo homepage)

image