markodenic / awesome-tech-blogs

A list of Tech Blogs.
805 stars 414 forks source link

Dead links #463

Open jcubic opened 6 days ago

jcubic commented 6 days ago

I've noticed one or two dead links. One pointed to (via two redirects to betting site) and the other to the marketplace to buy a domain.

If you are willing to merge my PR and I see if I can create GitHub workflow that will scan each link (in cron every week) if the field name appear on the blog.

I write the script locally using Puppeteer to scrap the text from the page and see if all URL have the somewhere on the page the title.

I think that deleting the URLs shoud be done by hand so I can show the list of invalid URLs that can be investigated by hand.

What do you think?

Ryuno-Ki commented 5 days ago

I wonder whether Puppeteer is needed here (instead of cURL + htmlq for example).

jcubic commented 5 days ago

A lot of people create websites in React that doesn't work without JavaScript.