Closed pdehaan closed 3 years ago
The current broken links might be coming from the <head>
metadata (for "canonical" and "og:url" hrefs):
<link rel="canonical" href="https://github.com/mozilla/experimenter-docs/experimenter-docs/" data-react-helmet="true">
<meta property="og:url" content="https://github.com/mozilla/experimenter-docs/experimenter-docs/" data-react-helmet="true">
Note the duplicated "/experimenter-docs/" in the URL, plus... not sure either of those are the canonical URLs (vs the desired https://mozilla.github.io/experimenter-docs/
href, but 🤷 ).
OK, possibly final one...
I have mixed feelings about this one, but we can try using markdown-link-check. It doesn't handle file globs, so we'll need to use the API and roll our own logic. But meh.
We might be able to be able to find a different link checker tool that will scrape an entire HTML site vs the input .md files.
For example, broken-link-checker will check a whole site, but fails on the RemoteSettings pages since they require VPN access.
I'm looking through the README, but it still isn't clear if it's possible to add a list of domains/URLs to ignore.(it's the--exclude
flag).┆Issue is synchronized with this Jira Task ┆Issue Number: EXP-1084