wummel / linkchecker

check links in web documents or full websites
http://wummel.github.io/linkchecker/
GNU General Public License v2.0
1.42k stars 234 forks source link

Checks external links? #697

Open zguillen opened 7 years ago

zguillen commented 7 years ago

I don't see a config option to set to enable linkchecker to get http status codes from external links in order to find links to others' websites that are now broken (I.E. returns 404, or a 500). I see in the desktop GUI's result column a lot of "Valid: filtered" messages with the "Info" property saying, "The URL is outside of the domain filter, checked only syntax." So is there a way to only crawl MY website's pages using the domain filter, but to also check URL's outside of the domain filter not only for syntax but also that they return an actual web page and not an HTTP error?

Is this possible?

peterjacobs commented 7 years ago

I use it for that use case too. The documentation is confusing and out of date There were also some changes lately related to this area I think.

Which version of linkchecker do you use?

does it work when you add the commandline option: --check-extern

dpalic commented 6 years ago

Thank you for the issue report. Sadly this project is dead, and a new team is around with https://github.com/linkcheck/linkchecker for more details please see: #708 Also please close this issue and report it freshly on the new repo https://github.com/linkcheck/linkchecker/issues