remotehack / remotehack.github.io

https://remotehack.space
MIT License
7 stars 3 forks source link

Create a bot to scan github readmes for http links and fix them #90

Open dev-earthhero opened 4 years ago

dev-earthhero commented 4 years ago

Http links are unnecessary in this modern world of encryption standards, and if https links work, there's no reason to use http.

In some cases, however, sites have yet to upgrade to https (wait til they hear about http/3!!!), so we dont necessarily want to blanket find-replace on match.

Ideally we would find all links, confirm via an http GET that the https version has a 200 status, then proceed with the replace.

Python would be super easy to do this with, but bonus points for golang!!!

NOTE: this doesn't specify the data source, so either it just crawls a bunch of things, or it's a single fire process that looks at one specific reader.

lpmi-13 commented 4 years ago

once this is up, point it at the github repo for https://emscripten.org/ because there are 2,000+ http links in those docs. YIKES!!!