Open gwern opened 8 years ago
Thank you for the issue report. Sadly this project is dead, and a new team is around with https://github.com/linkcheck/linkchecker for more details please see: #708 Also please close this issue and report it freshly on the new repo https://github.com/linkcheck/linkchecker/issues if your issue still persists
In a recent mailing list entry which included the URL
http://www.gwern.net//Longevity#metformin
(note the double-slash//
) where I had meant to writehttp://www.gwern.net/Longevity#metformin
, linkchecker failed to flag the broken URL during several checks I had made of the draft.The double-slash URL is wrong and leads to a 404 error when I check in Chromium, Firefox, elinks, wget, and curl. However, linkchecker does not flag an error when it is asked to check a file with that URL linked.
Here is an example input:
This should yield one valid link, and 1 error. However, linkchecker believes it yields 2 valid links and 0 errors:
More detailed output:
My guess is that maybe it's internally rewriting double-slashes to single-slashes to get a valid URL, thus seeing only 1 link to check (the valid one), even though this means that it hides the existence of links broken in all browsers I could check.