Closed Marooned-MB closed 9 years ago
Hmm, just found out that this is duplicate for #46 but that issue is closed even if the problem still exists.
OK, all is clear. I got v0.1.4 from http://nrabinowitz.github.io/pjscrape/ which is outdated. Sorry for confusion :)
Links to
//domain.tld/
are quite common to deal with http/https protocol. They should be expanded tohttp://domain.tld/
on http site orhttps://domain.tld/
on https site._pjs.toFullUrl() expand such links into
base//domain.tld/
which is of course wrong.Simple real world example: http://en.wikipedia.org/wiki/A Check the footer for
//wikimediafoundation.org/
and//www.mediawiki.org/
which are expanded tohttp://en.wikipedia.org//wikimediafoundation.org/
andhttp://en.wikipedia.org//www.mediawiki.org/
.