Open jimpriest opened 9 years ago
Hi Jim, just to be sure, are we talking about URLs such as http://[http//w.on24.com/r.htm?e=991027&s=1&k=DBEA8D7CD7CF38AE3A007AB5432DAC2B&partnerref=sapredhat
found on this page: http://verticalindustriesblog.redhat.com/tune-in-red-hat-sap-and-tabb-group-discuss-high-performance-computing-its-growth-in-financial-services-and-its-shrinking-cost/
Trying this link on firefox raises a Server Not Found error (not even a 404). I guess, pylinkvalidator should report a sensible error when the URL is not parsable. Just want to make sure I'm not missing other cases.
I think what is happening is the WYSIWG editor is trying to fix these links by adding an extra http// in the mix. Not sure where the bracket is coming from but they aren't IPv6 urls.
So yes, I think maybe just a more generic 'unparsable url found' error message may be more useful?
When checking some URLs I get the following error:
Even though the URL is is not formatted unusually.
Scan http://verticalindustriesblog.redhat.com/ with depth=1 for some examples.
I may modify my fork to just ignore this error but I'm not sure there is a correct way to 'fix' it. Googling it seems like an issue with Python 2.7.x.
I see it both on 2.7.5 and 2.7.10.