wummel / linkchecker

check links in web documents or full websites
http://wummel.github.io/linkchecker/
GNU General Public License v2.0
1.42k stars 234 forks source link

strange escaping of url generate a 400 BAD REQUEST error #716

Open oupala opened 7 years ago

oupala commented 7 years ago

One of the url I'm trying to linkcheck is:

http://webarchive.nationalarchives.gov.uk/20130129110402/http://www.hm-treasury.gov.uk/sternreview_translations.htm

linkchecker is returning an error:

Result Error: 400 BAD REQUEST

With the following requested url:

http://webarchive.nationalarchives.gov.uk/20130129110402/http%3A/www.hm-treasury.gov.uk/sternreview_translations.htm

As you can see, the http:// in the middle of the url has been translated to http%3A/.

I think that might explain the error!

dpalic commented 7 years ago

this project is dead, please see: https://github.com/wummel/linkchecker/issues/708

oupala commented 7 years ago

Unfortunately, the repository owner didn't add a warning in the readme file.

And the bad thing is that you created the new repo without forking from the old one. The consequence is that there is no logical link between the original repo and your new repo.

Moreover, a search for linkchecker in Google return Wummel's link first while your repo is not in the ten first pages from Google results.

Fortunately, you sometime visit the old repo to warn people that this repo is abandonned. Thanks @dpalic for the tip.

dpalic commented 6 years ago

Thank you for the issue report. Sadly this project is dead, and a new team is around with https://github.com/linkcheck/linkchecker for more details please see: #708 Also please close this issue and report it freshly on the new repo https://github.com/linkcheck/linkchecker/issues