wummel / linkchecker

check links in web documents or full websites
http://wummel.github.io/linkchecker/
GNU General Public License v2.0
1.42k stars 234 forks source link

Applet codebase incorrectly handled #22

Closed wummel closed 11 years ago

wummel commented 11 years ago

Converted from SourceForge issue 636802, submitted by majid

I have the following code on one of my pages (http://www.majid.info/radio/): <applet code=panoapplet script language="JavaScript" codebase="http://www.majid.info/images/&quot; height=266 name=FPViewer1 width=400 archive="panoapplet.jar"> <param name=file value="http://www.majid.info/images/louvre.ivr&quot;&gt; <param name="autoSpin" value="-50"> </applet>

And linkchecker reports an error because it tries to load http://www.majid.info/radio/panoapplet.jar instead of codebase + archive = http://www.majid.info/images/panoapplet.jar

There seems to be some minimal support for codebase in UrlData.py, but not as far as prepending it intelligently to the URL the way <A BASE> is handled.

URL panoapplet.jar Parent URL http://www.majid.info/radio/, line 46 Real URL http://www.majid.info/radio/panoapplet.jar Check Time 0.051 seconds Result Error: 404 Not Found

wummel commented 11 years ago

Submitted by calvin

Logged In: YES user_id=9205

Yup, I am working on this. A codebase is then valid for 'archive' and 'src' links, overriding a <base> tag. The codebase link itself wont be checked anymore, as its not supposed to be a working link, only a prefix.